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ABSTRACT 


This thesis is concerned with the analysis of security investigation data extracted 
from the investigative files of 564 U.S. Navy first-term enlisted personnel who came on 
active duty between 1979 and 1982. The individuals had all completed their first term 
of service and had either completed service satisfactorily or had been released early with 
an adverse discharge. The data was selected from six character-of-service categories: 
good, homosexual, drug/alcohol abuse, misconduct, court martial, and character and 
behavior disorders. The purpose of the thesis was to investigate optimal ways to con- 
figure a large, categorical data base and to look for and quantify relationships between 
investigative data and final disposition of service. Several noteworthy relationships were 
found between derogatory information developed in the investigation and the subse- 
quent character-of-service. Further avenues of investigation using this data are 


suggested. 
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I. INTRODUCTION 


A. BACKGROUND 

The importance of protecting sensitive military information and operations from 
potentially hostile sources is a concept as old as warfare itself. Events of the recent past 
indicate that the nation must never grow complacent about its ability to safeguard clas- 
sified information. World-wide defense commitments, the ideological and historic dif- 
ferences existing between the US and other nations, and the huge number of people who 
frequently access, create, analyze and service the vast amount of sensitive information 
combine to create a tremendous managerial problem: Who can be trusted with access 
to the nation’s security secrets? 

The need to investigate the backgrounds of those people needing access to classified 
information has been a fixture of the national security establishment for many years. 
Typically, an individual, by virtue of his duty responsibilities, is determined to need reg- 
ular access to sensitive information of some level (secret, top secret, sensitive compart- 
mentalized information. etc). A fairly standard administrative procedure is employed 
throughout the Department of Defense (DOD) in order to determine whether the person 


should be allowed access to classified information, 


B. THE SECURITY INVESTIGATION PROCEDURE 

The first element in a security investigation is the completion of a detailed form 
named the Statement of Personal History (SPH). The SPH requires specific information 
about a person’s past. Information such as a list of close familv members, foreign travel, 
arrests and convictions, schools attended, jobs held, creditors, and personal references 
are all required. The SPH is the starting point for any security investigation. 

The next step in the investigation consists of the National Agency Check (NAC) and 
the Local Agency Check (LAC). Law enforcement agencies, both local (.e., city or state 
police) and national (1.e., the FBI) are queried about outstanding warrants and records 
of arrests. A check of credit information 1s also conducted with national and local credit 
bureaus to determine whether an individual has money problems. 

The clearance will normally be granted to a person who requires access to informa- 
tion with a classification of Secret or lower when the above procedure does not turn up 


any inconsistencies. 


A person requiring access to top secret or higher level information will undergo a 
much more detailed investigation: a background investigation (BI), or a special 
background investigation (SBI). These investigations are much more thorough than 
those for lesser clearances and involve actual interviews with people who know and have 
developed a relationship with the individual being investigated. Neighbors, friends, 
school officials, former emplovers and others may be interviewed. If the answers are 
consistent and positive, the subsequent investigation will be much less detailed than if a 
negative trend develops and other sources of information are “developed” by the inves- 
tigators. If information 1s developed which contradicts that listed on the Statement of 
Personal History or 1s conspicuously absent from it, the subject will almost certainly 
be interviewed. In certain other types of investigations, an interview is always required. 

The result of this investigation is a dossier containing basic biographical data, de- 
rogatorv information obtained from the SPH and other sources (or lack of such infor- 
mation) and recommendations as to the trustworthiness of the subject of the 
investigation. Derogatory information varies from traffic infractions to emotional 
problems to felonies. All the investigative data 1s gathered for the clearance determi- 
nation. An adjudictor reads the investigation file and makes the judgement as to the 
award of the clearance. 

The last step in the security investigation process 1s a review of the information ob- 
tained and determination of whether the clearance should be granted. 

Review of the information is performed in accordance with Adjudication guidelines 
contained in the DOD Personal Security Regulation, DOD 5200.2-R, dated January, 
1987. The factors which can disqualify an individual for a clearance are listed as well 
as the mitigating factors which might allow a clearance to be granted even though a 
disqualifving factors are present in the information. For example, a person might admit 
to experimental use of marijuana (less than six instances of use) in their adolescence. 
This use of cannabis (inarijuana or its derivatives) is considered a disqualifying factor. 
A mitigating factor in this instance 1s that the experimental abuse occurred more than 
six months ago, and the individual has no intention of using cannabis or other drugs in 
tine ft une eee 

The final determination of clearance for an individual whose record contains dis- 
qualifying information is a subjective one. It is based upon the merits of the case, and 
the evaluation of the adjudicator as to the mitigating factors which hopefully indicate 


the actual rehability of the individual in the future. 


C. BACKGROUND OF THE SPECIAL BACKGROUND INVESTIGATION DATA 
BASE (SBID) 

It is apparent that the investigation procedure must generate a tremendous amount 
of data about every person who is investigated for a security clearance. It is clear that 
we do not wish to trust national security information to those who are untrustworthy 
enough to violate laws, regulations, and accepted standards of conduct. Could this data 
be used to examine whether data obtained from the security investigations Were in any 
way related to the future service record of those investigated? Could this data provide 
insight into the investigation process, allowing investigative resources to be more effi- 
ciently allocated? 

The Defense Personal Security Research and Education Center (PERSEREC) in 
Monterey, California was directed to examune a large sample of data produced from se- 
curity investigations of first-term enlistees entering the Navy during the years 1979 - 
1982. The purpose of the study was to develop insight about the information developed 
In security investigations, especially when the final disposition of service of investigative 
subjects was known. 


The individuals whose records were involved in the study: 
1. Had background investigations initiated within three months of enlistment; 


2. Were separated or discharged during, or upon completion of their initial tour of 
duty; 


3. Were discharged for homosexuality, misconduct, drug abuse, court martial, char- 
acter and behavior disorder, or normal completion of enlistment. 


Thus, in the data base, there are five types of unsuitability discharge categories and 
One control group of personnel who successfully completed their term of service. 

Seven-hundred records were selected randomly (based upon the last digit of the so- 
cial security number) for the study. One-hundred cases were selected from each of the 
five unsuitability discharge groups and two-hundred cases in which the individuals were 
normally separated. The number of cases which were eventually included in the study 
numbered 564 because those cases where the Background Investigation was cancelled for 
any reason were removed. 

The number of records chosen in each category were not in relation to the charac- 
ter-of-service category's proportion in the actual population. An immense number of 
records would need to be drawn as a single sample in order to get a large enough rep- 


resentation from each adverse discharge category. As an illustration, consider that there 


are 73 records in this data base from the court martial character of service category. 
Persons who are investigated receive this adverse character-of-service designation ap- 
proximately 0.18% of the time. Simple arithmetic indicates that to get approximately 
73 records in this category from a single sample from the investigation population at 
large would require a sample size of nearly 41,000. It seems obvious that this is not 
reasonable. Table | displays the approximate percentages of those initially investigated 
who receive each of the six character-of-service designations discussed in this thesis (Ref. 


2]. There are other designations which are not considered here. 


Table 1. CHARACTER OF SERVICE CATEGORY PROPORTIONS 





Character of Service Category % of Investigation Population Receiving 
Category 


Homosexual 0.92% 









Misconduct 26 


Drug’Alcohol Abuse 1.8% 


Court Martial 
Character Behavior Disorder 





The data base was created by taking the investigation information from microfiche 
and entering it into a Lotus 123 spreadsheet. There were 93 possible entries for each of 
the 564 records resulting in a total data base with the potential for approximately 52,500 
data points. 

The data was essentially categorical in nature with an individual record containing 
personal information ranging from date of birth and military specialty to findings from 
high school to type of discharge. A four-digit code representing the type of derogatory 
information was the prime means of listing this data and allowed standardization across 
the data base. Other codes were created to represent other pieces of information such 
as the recommendations obtained at the various sources (high schools, colleges, neigh- 
borhoods, etc.), race or marital status. 

Problems with the size of the data base, the slow response of an AT-style micro- 
computer when dealing with such a large data set, and the limitations of Lotus 123 in 


performing statistical functions allowed only a cursory analysis of the data base as 


originally implemented. Clearly another approach was necessary to analyze and obtain 


insights from this data. 


D. PURPOSE 

The purpose of this thesis is two-fold: to investigate some available methods for 
organizing and analvzing a large, categorical data base; and to use statistical and data- 
analytical techniques to evaluate the personal security data detailed above in order to 
develop insights and correlations between the security investigation data and the subse- 


quent disposition of the subject’s term of enlistment. 


E. LIMITATIONS 

The data used in this paper was analyzed as provided. It was not possible to ensure 
actual random selection of the data, however we assume that each sample was selected 
randomly. The data was selected in an arbitrary manner (one-hundred records from 
each of the unsuitability discharge categories and two-hundred records with normal 
completion of service). It may be difficult to apply the results of this investigation to the 


general population. 


F. ANALYTICAL TOOLS USED 

The data was initially reduced and documented using the Statgraphics (version 2.6) 
Statistical software package on a Compaq 286 portable personal computer with two 
megabytes of additional random access memory (RAM). After reduction it was trans- 
ferred to an IBM 3033 System 370 mainframe computer using the MVS batch system. 
On the mainframe computer, Grafstat, an unreleased IBM mainframe data analysis and 
Statistical package was used. In addition, APL programs for categorical analvsis were 


Written using APL Graphpak to supplement the routines available in Grafstat. 


G. ORGANIZATION OF THESIS 

Following this introduction, the data reduction techniques used for this thesis and 
the lessons learned from that effort are discussed in Chapter I]. The main body of the 
thesis is contained in the Chapter III and deals with the data operations and the analysis 
conducted. Chapter IV discusses some promising areas for further analysis which were 
only briefly pursued because of time constraints. The closing chapter will summarize the 
results of this research, set forth the conclusions drawn from those results and provide 


recommendations for future research involving this data. 


Il. DATA REDUCTION 


A. GENERAL 

PERSEREC experienced problems in attempting to analyze a data base of this 
magnitude. This led them to investigate other methods of configuring the data in order 
to perform the analysis they felt was necessary. Subsequently, the Lotus 123 files were 
exported to the mainframe computer and configured into Conversational Monitoring 
System (CMS) ASCII files. The categorical nature of the data and its overwhelming size 
dictated that documentation and verification of the data base was necessary before any 
further useful analysis could be performed. However, the data editors available in CMS 
on the mainframe computer did not offer the ability to easily operate on column fields 
and did not have the flexibility needed to simultaneously document the work performed 


as it proceeded. 


B. DATA EDITING 

Statgraphics (version 2.6) offered a user-friendly data editor offering the requisite 
capabilities. Unfortunately, it was available onlv on a personal computer. A Compaq 
286 portable AT-compatible micro-computer with two megabytes of additional memory 
(useable as a virtual disk) was used. It proved extremely useful; however, its size limited 
the amount of data which could be operated upon without exceeding the memory limi- 
tations of the computer (these memory restrictions will be alleviated in the future when 
using the new 80386 based machines). 

The CMS files were transferred into micro-computer ASCII files and then stored 
on floppy disks and subsequently read into six Statgraphics (ASF) files. Each of the files 
consisted of approximately 15 of the variable entries for each of the 564 records (ap- 
proximately 8400 data points). At any one time six or seven of these variables could be 
operated upon within the data editor. 

A general procedure wasefollowed in formatting and verifying each of the six files. 
First, the file was checked to insure that the data, as it existed on the CMS files, had 
been transferred correctly. In one instance half of the field of one variable was truncated 
and had to be reconstructed. 

Next, the numeric coding used for each column was researched and ambiguities re- 


solved by recoding or removal. This step required considerable research into the coding 


methods and the investigation process in order to understand, and, if necessary, change 
the numeric codes for the sake of clarity. 

Finally, a frequency tabulation of each column was performed and labels were cre- 
ated which corresponded to the coded values. These labels were especially useful later 
in the analysis when cross-tabulations between variables vectors were conducted. 

The procedure discussed above was iterative as sometimes several interpretations 
resulted before one was confirmed as correct. Documentation of the data base was 
conducted throughout these three steps. The list of the variables contained in the data 
base, their purpose and their tvpes are contained in Figure 1 through Figure 3. These 
figures are a direct copy of the file management screen that appears in Statgraphics as 
vou enter the full-screen editor or view the data directory. Comments are limited to 21 


characters for each variable. 


VARIABLE WIDTH TYPE RANK LENGTH DATE TIME SCOuNeENT 


3/18/88 
2/26/88 
3/18/88 


RECORD NO. (RANDOM) 
SEX (MALE OR FEMALE) 
BIRTHDATE 


5 
3 
8 
8 
8 
3 
4 
3 
6 
6 
6 
6 
6 
6 
6 
6 
6 
6 
6 
6 
4 


ee ee ee ee ee ee ee cee Oe ee oe 


ee ee 


3/18/88 
3/18/88 
2/26/88 
2/26/88 
2/26/88 
2/26/88 
2/26/88 
2/26/88 
2/26/88 
2/26/88 
2/26/88 
2/26/88 
2/26/88 
2/26/88 
2/26/88 
2/26/88 
2/26/88 
2/26/88 


Figure 1. List of Variables Contained in the Data Base: 


graphics Data Management Screen. 


DATE OF ENTNAC 

BI REQUEST DATE 
REASON FOR BI 
OCCUPATION CODE 
REASON FOR INTERVIEW 
PNTERVIEW INFO =- 1. 
TNGERVIEW INFO. e 2. 
PERV IEW INFO: =03. 
INTERVIEW INFO - 4. 
FBI/DCII FINDINGS1 
FBI/DCII FINDINGS2 
LOCAL AGENCY CHECK 
LOCAL AGENCY CHECK 
LOCAL AGENCY CHECK 
LOCAL AGENCY CHECK 
CREDIT BUREAU CHECK 
CREDIT BUREAU CHECK 
H S - # OF SOURCES 





Extracted from the Stat- 


VARIABLE WIDTH TYPE RANK LENGTH DATE TIMES COMMENDS 


3/ 4/88 14: SCHOOL RECOMM. 
3/ 4/88 14: SCHOOL RECOMM. 
3/ 4/88 14: SCHOOL RECOMM. 
2/26/88 15: SCHOOL FINDINGS 
2/26/88 16: SCHOOL FINDINGS 
2/26/88 16: SCHOOL FINDINGS 
2/26/88 16: SCHOOL FINDINGS 
2/26/88 10: .- 4 OF SOURCES 
2/26/88 11: , RECOMMENDATION 
2/26/88 11:00 COLLEGE FINDINGS 
2/26/88 11:00 EMPL. # OF SOURCES 
2/26/88 10:53 CO-WORKER # SOURCES 
4/88 11:28 EMPLOYMENT RECOMM. 
4/88 11:15 EMPLOYMENT RECOMM. 
4/88 11:15 EMPLOYMENT RECOMM. 
4/88 13:36 EMPLOYMENT FINDINGS 
4/88 13:36 EMPLOYMENT FINDINGS 
4/88 13:36 EMPLOYMENT FINDINGS 
4/88 13:36 EMPLOYMENT FINDINGS 
2/26/88 10:54 NEIGH. # OF SOURCES 
4/88 12:02 SPH NEIGH. RECOMM. 
4/88 12:02 DEV. NEIGH. REC. 
4/88 12:02 DEV. NEIGH. REC. 
4/88 14:06 NEIGH. FINDINGS 
4/88 14:06 NEIGH. FINDINGS 
4/88 14:06 NEIGH. FINDINGS 
2/26/88 10:56 # OF OTHER SOURCES 
3/ 4/88 15:17 OTHER RECOMM. 
3/ 4/88 15:17 OTHER RECOMM. 
3/11/88 08:29 OTHER FINDINGS 
3/11/88 08:29 OTHER FINDINGS 
3/11/88 08:29 OTHER FINDINGS 
3/11/88 08:29 OTHER FINDINGS 
2/26/88 10:46 RACE 


ee ee ae 
PREP PP EPP Pe Pe Pe Pe PPP PPP ee Be PP ee Pe PP Pe 


5 
6 
6 
) 
5 
5 
, 
5 
4 
5 
3 
5 
3 
6 
6 
b) 
5 
5 
> 
2 
3 
3 
6 
5 
5 
5 
3 
3 
6 
5 
5 
5 
= 
Z 





Figure 2. List of Variables Contained in the Data Base (Continued): Extracted 


from the Statgraphics Data Management Screen. 


VARIABLE WIDTH TYPE RANK LENGTH DATE TIME... COMMENT 

AG oer XA iL 564 2/26/88 10:48 MARITAL STATUS 

AJ 2 mill i, 564 2/26/88 10:48 DEPENDENTS 

AN 6) eel dl 564 2/26/88 10:48 # OF SIBLINGS 

AO asi | 1 564 2/26/88 10:48 PERMANENT RESIDENCE 
AQ jean 1 564 3/18/88 14:17 ENLISTMENT DATE 

AR Sy 1 564 2/26/88 10:48 AGE AT ENLISTMENT 

AS 4 | i! 564 2720/88 10:49 MONTHS HS TO” ENLIST 
AT 4 |] 1 564 2/26/88 10249) JOBS Hs TO ENLIST 
AU 5 al Ik 564 2/26/88 VWOrao 7 MONTHS UNEMPL. 

AV eal al 564 2/26/88 10:49 # MONTHS COLLEGE 

AW 2a ue 564 2/20/ Cem ror 420, UNEMPIie ER IOR ENL 
AX1 Dig ol 1 564 3/11/88 12:54 UNFAV. INFO. ON SPH 
AX2 Eye Jl 1 564 3/11/88 12:54 UNFAV. INFO. ON SPH 
AX3 5) ll 1 564 3/11/88 12:54 UNFAV. INFO. ON SPH 
AX4 Deal 1 564 Sy) 68 she UNEAY. INFOWON SPH 
AY1 De orl 1 564 S/7 11/68 TivSomoUlMARY ol 

AY2 Dae 1 564 3/11/88 11; 36,SUMMARY, BI 

AY3 ae! 1 564 3/10/68 Tiao SUMMARY BI 

AY4 Dawe! i! 564 3/11/88 11: 36°SUMMARY 6 I 

BB c 2 564 8 3/18/88 14:42 

BC Siete I 564 7/20/66 OP SoeOnEARANCE TYPE 

BD C 2 564 8 3/18/88 14:43 CLEARANCE REV.: DATE 
BE C 2 564 8 3/18/88 14:44 DATE OF SEPERATION 
BF oe 1 564 yi 26, oo 10456 Kono AGE CODE 

BG1l pall 1 564 3/11/88 14:06 MILITARY OFFENSES 
BG. al i 564 3/11/88 14:06 MILITARY OFFENSES 
BG3 jit I 564 3/11/88 14:06 MILITARY OFFENSES 
BG4 Sl 1 564 S/ i788 14: Cosi TARY “OFFENSES 
BH1 5 fal IL 564 3/11/88 14:29 REMARKS /DISCHARGE 
BH2 Sie iL 564 3/11/88 14:30 REMARKS/DISCHARGE 
BH3 See 1 564 3/11/88 14:30 REMARKS /DISCHARGE 
BH4 Se gel 1 564 3/11/88 14:30 REMARKS /DISCHARGE 

BL ATI 1 564 2/26/88 10:08 STATUS OF 5520/20 

BM ayy pil 1 564 2/26/88 10:08 DISCHARGE CASE CODE 
BO 6) cl 1 564 2/26/88 10:08 INTERSVC. SEP. CODE 
BP Ze i 564 2/26/88 10:08 CHARACTER OF SERVICE 
BQ 2M i 564 2/26/88 10:08 TYPE OF DISCHARGE 
Figure 3. List of Variables Contained in the Data Base (Continued): Extracted 


from the Statgraphics Data Management Screen. 


C. DATA REPRESENTATION PROBLEMS 
Inherent in the verification and documentation of a large data base obtained from 


an outside source are coding inconsistencies. Ideally, thorough documentation of the 


codes used and the thought process employed in creating the data base is included with 
it. However, this is seldom the case. 
The PERSEREC data base had many inconsistencies along with several strengths. 
A major strength of the data organization was the standardization of most of the coding 
emploved. Derogatory information codes (used in 43 of the 93 columns) and recom- 
mendation codes (used in 13 of the columns) were used in a fairly standard manner. The 
numeric code for all derogatory information contained in the data base consisted of a 
standard four-digit code representing 135 different infractions. The list of infractions 
and their codes is listed in Appendix B. 
The numeric code used for the types of recommendations obtained from various 
sources consisted of a two-digit integer representing the total number of persons who: 
1. Recommended the subject for a position of trust; 
2. Recommended the subject for a position of trust, with supervision; 
. Did not recommend the subject for a position of trust; 
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4. Declined comment. 


Most sources of derogatory information are represented by several columns in the 
data base. A source is considered a location such as college, high school, employer, 
neighborhood, etc.. Multiple columns are available for each source categorv to allow 
room for several different tvpes of derogatory information to be displayed, if necessary. 
Table 2 shows how the information of columns Y1, Y2, Y3, and Y4 (findings or derog- 


atory information obtained from emplovers) was represented: 


Table 2. INITIAL REPRESENTATION OF DEROGATORY INFORMATION 
(EXAMPLE). 





After research, these records were interpreted in the following manner: If there are 


only 9999 entries in a particular record’s entries in Yl - Y4, then no derogatory 


information from the subject’s former employers was found. The possibility of no in- 
terview being conducted is reasonable, although all information indicates that former 
employers were visited in almost all instances. If any 9999 entries are contained along 
with derogatory information for a particular record, those 9999 codes are meaningless. 


The corrected records are shown in Table 3. 


Table 3. REPRESENTATION OF DEROGATORY INFORMATION AFTER 
REDUCTION (EXAMPLE). 


a 
Sor re 
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In this table no information was obtained on the person represented by record 
number I. For the second person, the investigator found evidence that the person was 
known to lie (1071), and that he was at some time intoxicated in public (1106). The third 
person had evidence of vandalism (1829) and malicious mischief (1844). The fourth 
person was found to have an incident of reckless driving (1805) and also illegal use of a 
firearm (1824). 

Columns representing derogatory information obtained from colleges, high schools, 
neighbors, and other sources were simularly reduced. 

As discussed above, the 9999 code used in columns YI - Y4 represented “no derog- 
atory information.” Research revealed that this interpretation of the 9999 code could 
not be used in some of the other columns. In the security investigation realm, employers 
and neighbors are considered “productive” sources. With that designation, the former 
employers and neighbors of a subject are almost always interviewed, thus the 9999 code 
for those sources means “no derogatory information.” Sources other than employers and 
neighbors, on the other hand, are normally only visited by an investigator when he 1s 
fairly certain to obtain derogatory information. The 9999 code in conjunction with these 
tyvpes of sources means “no interview conducted.” 

An even more confusing coding scheme was discovered relating to the recommen- 


dations obtained from the five tvpes of sources outlined above. For the emplover, high 


school, college and other sources, a 99 code represents “no interview.” The coding for 
neighborhood recommendations was different. 

Neighborhoods are the source of many developed sources of derogatory informa- 
tion. A distinction was made between the recommendations of neighbors listed on the 
SPH (generally positive) and those from neighborhood sources developed bv the inves- 
tigators. This resulted in four possible entries for recommendations from a subject's 
neighborhood. The column vectors representing information obtained from the subject’s 
neighbor are designated AAI-AA4. Column AAI represents the recommendations ob- 
tained from persons listed on a subject’s Statement of Personal History. Entries in col- 
umns AA2-AA4 were recommendations obtained from neighborhood sources developed 
by the investigator. A 99 entry in column AAI meant “no interview conducted,” while 
a 99 entry in column AA2 means “no sources developed.” Furthermore, a 99 entry in 
columns AA3 or AA4 meant nothing. These variable fields were repaired by removing 
all 99 codes from columns AA3 and AA4. 

Another instance of miscoding occurred in column AN, which represents the num- 
ber of siblings of the subject. Throughout the field a character code of “LU” existed along 
with the usual integers ( 1, 2,...) representing the number of siblings. This code was 
thoroughly researched until the only possible explanation was obtained--it represented 
“unknown.” 

The problems highlighted here point to the importance of differentiating, by coding, 
even small differences in meaning when implementing codes. The failure to do so risks 
losing important distinctions which may in fact invalidate the data. Another point to 
be made is that documentation is essential when data bases are created. Luckily, the 
person who performed the data entry was available for reference throughout the data 
reduction stage of this project, otherwise much of the information contained in the data 
base might have been lost. 

Erroneous entries were not commonly found in the data base. Only two erroneous 
codes (not of the 135 actual derogatory information codes) were found and they were in 
the same column. Research into the underlying record revealed that the codes had digits 
transposed and the corrections were easily made. 

Missing values, or blanks, were common in some columns. Care had to be taken 
to preserve these blanks when transferring from one system to another. The Stat- 


graphics representation of blanks as the integer -32768 proved useful in this regard. 


The files were initially represented in a random order by record number. This 
proved inconvenient when cross-validation of the record to its original file was necessary. 
The use of APL in conjunction with Statgraphics allowed all records to be reorganized 
in ascending order and made the file much easier to reference. 

Date fields were entered as six-digit codes representing month-day-year. Problems 
Were encountered with formatting as Statgraphics requires a slash (/) between the month 
and day and the day and year. A simple APL function was written which performed this 


conversion. 


D. RECOMMENDATIONS FOR CODING A LARGE DATA BASE 


1. Care must be taken to differentiate even subtle variations in meaning by using dif- 
feremt codes. 


2. The data base must be designed with the proper analytical tool (software and 
hardware) consistent with the purpose and goals of the analvsis. 


3. Proper documentation is essential when creating a data base. This is important not 
only for the data base creators to have for their own memory, but also so that 
others may use the data base. It is also important because others mav use the data 
long after the creator has finished with it and is available to answer questions. 


4. Design of the data base should be a slow, careful affair. If this stage is neglected, 
the data base designer risks wasting many hours of work and compromising the real 
value of the data base. 

E. RECOMMENDATIONS FOR IMPLEMENTING A LARGE DATA BASE 
Statgraphics has a scrollable data editor which allows the entry, manipulation, and 
review of large data bases. It is convenient, simple to use, and, most importantly, makes 
it easy to correct and manipulate the data when anomalies are detected. 
In view of the value that such a scrollable data editor provided when reducing and 
documenting a data base which 1s already in existence, here are some recommendations 
for data base design. The design should: 


1. Allow for speedy input of and access to new data; 


2. Allow the data to be manipulated and massaged with scrollable full-screen data 
editors; 


3. Allow easy access by statistical graphics packages such as Grafstat. 
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I. DATA ANALYSIS 


A. GENERAL APPROACH FOR THE EXPLORATORY DATA ANALYSIS 

The primary question which this thesis attempts to answer 1s, “What relationships 
exist between the information derived from the subjects’ background investigations and 
the final disposition of their service?” The answers obtained here will not, of course, be 
all inclusive but provide a starting point for further research involving this data base. 
In particular, this is not the only question to be answered from the data base, but as in 
much research, other questions and facts become apparent as the research progresses. 

Inherent in a data analysis 1s the initial investigation into the properties and limita- 
tions of the data. The PERSEREC special background investigation data (SBID) 1s 
primarily categorical in nature. The record for each individual! contains several different 


tvpes of information: 


1. Background and biographical information such as age, marital status, reason for 
investigation, etc.; 


2. Derogatory information (or lack thereof) obtained bv investigators from various 
sources (high school, neighborhood, emplovers, etc.); this information mav consist 
of crimes, Subject admissions, and other matters that reflect on the person’s char- 
acter and judgement; 


3. Recommendations from various people associated with these sources as to whether 
they felt that the individual in question should be trusted with a position of trust 
and responsibility; 


4. The result of the term of military service, whether the individual was discharged 
normally, or due to some adverse circumstances. 


The data can viewed as information obtained prior to the completion of the inves- 
tigation (explanatory variables or independent variables) and information which 1s the 
result of the person’s service after the investigation (response variables or dependent 
variables). 

Note that the data is basically categorical, e.g., male or female, and thus has no 1n- 
herent ordering. Thus, while frequency counts can be obtained and are given in Ap- 
pendix A, no distributional measures, e.g., means or variances, can be computed. 
Similarly, dependencies and associations cannot be measured by moments based upon 
joint distributions, e.g., correlation coefficients. 

The data thus appears to be ideal for contingency table methods [Ref. 3: pp. 153 - 


170]. However, note that the one response variable in the contingency table 1s almost 
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alwavs the character-of-service categories (six of them), and these have fixed marginal 
frequencies of occurence. Thus the type of contingency table analvsis applied is for dif- 
ferences in probabilities (of the occurrences of the free category in each of the fixed ca- 
tegory samples). Thus, one would test that the probability of discharge for each of the 
six categories (good, homosexual, misconduct, drug/alcohol abuse, court martial, 
character; behaviour disorder) is the same for males or females. 

The background, derogatory, and recommendation data is very specific and limited 
to that obtained from each type of source. The information received from any one 
source, say an individual’s high school, does not paint a very complete picture of the 
person, no matter whether that information 1s primarily good or bad. Of more interest 
is the overall characterization of the individual’s past which is represented by the record 
as a Whole. The obvious question is how can many pieces of different information, 1.e., 
different columns in the data base, be combined to give an overall picture of the person’s 
character? 


There are several approaches used in this thesis to answer that question: 


1. To look at particular biographical information which may provide insight into dis- 
charge categories; 


2. To look at the total quantity of derogatory information as a measure relating to the 
discharge categories; 


3. To look at the existence and quantity of particular tvpes of derogatory information 
as a measure relating to the discharge categories, 1.e., evidence of drug use discov- 
ered in the investigation; 


4. To look at the tvpes, amounts, and quality of the recommendations that persons 
Were willing to give about the subject of the investigation. 

The data used for this research was configured as coded APL vectors ordered by 
record number such that all information for record number | was in the first position 
of each variable vector and so forth for all records. The numeric codes represented each 
particular category of the information contained in that type of variable vector. 

It was a relatively straightforward matter to use APL logical operations to collect 
the information needed for each approach discussed above and then to represent this 
information in an APL vector. For example, if the existence of drug use in the records 
was the information desired, each of the 35 columns which represented derogatorv in- 
formation Were operated upon in turn and a 0 or | resulted which represented no drug 
use or drug use in a particular column. As this information was obtained the informa- 


tion was totalled so that in the end a vector representing the total amount of drug use 


Is 


discovered by the investigators was available. The result was an ordered, integer-valued 
APL vector which could be cross-tabulated with the character-of-service vector to de- 
termine if the character-of-service was independent of evidence of prior drug use, e.g., 
the probability of discharge for drug’/alcohol abuse was the same for any level of prior 
drug abuse. With an easy APL logical operation this variable could be quickly trans- 
formed into a binary vector with 0 representing no evidence of drug use and | repres- 
enting at least one incident on record. 

The information, once configured into appropriate independent variable vectors, and 
the response variable vector (generally character-of-service) were cross-tabulated to cre- 
ate a contingency table, generate residuals, and produce the chi-square test statistic al- 
lowing a test of difference of probabilities to be performed. The graphical mainframe 
software package, Grafstat, did not offer a cross-tabulation capabilitv; however, a cus- 
tomized APL cross-tabulation function developed by Luis Uribe and Professor Peter A. 
W. Lewis was used which operated in the Grafstat environment. This package was 
developed to allow, simultaneously, a visual and tabular way of looking at the cross-ta- 
bulation of the categorical data while also providing the needed information to quickly 
perform any of the chi-square contingency tests (the independence test, the difference 
of probability test, or the fixed marginal total test). These tests are conducted in iden- 
tical manners; however, they differ in the manner in which the data 1s sampled and thus 
in the way the test is interpreted. Figure 4 shows an example of the output of the 


cross-tab function. 
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SUBJECT SEX VS. CHARACTER OF SERVICE 
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Figure 4. Sample Cross-Tabulation: The Cross-Tabulation of Sex vs. Character 


of Service. 


The graphical display in Figure 4 is a combination of tabular data representing the 
resultant contingency tables of the cross-tabulation and scaled sunflower plots repres- 
enting the relative counts produced in the table. The sunflower plots are coded in the 
Same position and color (on the color graphics screen) as the related tabular informa- 
tion. The number of arms in the sunflower plot is proportional to the total counts in 
the table. The graph is designed so as to make it easy for a user to quickly locate cells 
with high (or low) incidence. Thus one can quickly see from the graph in Figure 4 that 
the highest incidence is of males having good discharges and that no females in the 
sample were discharged for drug‘alcohol abuse. 

The table below the graph gives the actual count for each cell, the cell’s percentage 


of the particular row marginal total, and the residual produced using the standardized 
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difference between the actual cell count and the expected cell count (see below for defi- 
nition). When color output is available, ‘significantly’ large residuals are displayed in 
rear 

The cross-tabulation is a simple procedure. Two vectors representing two different 
types of information on each person are configured with a numerical code representing 
specific classes within each vector. The vectors are ordered identically so that the in- 
formation in the first position of both vectors represent information about the same 
person (or record) and so forth for all record positions. The cross-tabulation is merely 
a count of the various matchups of categories between the vectors. Figure 4 shows the 
cross-tabulation of the subjects’ sex versus their character-of-service. The variable re- 
presenting sex 1s coded as: 


QO - represents male. 


1 - represents female. 


The variable representing character-of-service 1s likewise coded as: 
Q - represents a discharge for service characterized as good. 
1 - represents a discharge for homosexual-related problems. 
2 - represents a discharge related to misconduct. 
3 - represents a discharge related to drug or alcohol abuse. 
4 - represents a discharge as a result of a court miartial. 


5 - represents a discharge for a character or behaviour disorder. 


The character-of-service categories are the fixed-size categories (columns in 
Figure 4). The sex of the individuals represent the free-size categories (rows in the same 
figure). 

The upper left cell in Figure 4 shows a count of 160 where the row designated 
“male” and the column designated as “good” intersect. This occurred because there were 
160 records where the code 0 representing sex matched up with the code 0 representing 
character-of-service. Each cell in the cross-tabulation was constructed in this same 
manner. 

The cross-tabulation function used in this thesis assisted in the conduct of the chi- 
square test of differences in probability. This test is conducted when separate samples 
are taken from several populations. In this case the several populations are the six po- 


pulations of people with different character-of-service records, these populations having 


been fixed at the beginning of the data collection. There are n, observations possible 
from each separate population, } and N=n,+7,+7, +7, +1, +7, total observations. 
Assumptions of the test are: 


1. Each sample is random. 


2. Outcomes of various samples are independent. 
3. Each observation can be classified into one of m classes. 
First here is a definition of terms to be used. Let, 


0, = 


=) 7y 


the cell count in the cell formed from row i and column (mites O55. 10 
Figure 4, 1s 59, the number of males (1 


1) with homosexual ‘character-of-service’ (} 


r= 


the marginal total of counts which fall in the category represented by category 


1, Which represents the total of all entries from row 1. Thus, in Figure 4, r, 1s the total 
number of males and r, is the total number of females. 


r 


n, = the marginal total of counts which fall in the category represented by category 
}, Which represents the size of the sample from the j” population 


N=), 


_ 


Ce 


; the total number of records. 


..., mt, the index of the random row categories. 


move index of the fixed columm categories. 


Py = 


the probability that an individual possesses classification 1, given that thev 
are a member of sample j. 


In the example of Figure 4 this is the probability that a 
person is male or female (classification 1) given that the person is from a particular 
character-of-service category. 


The hypotheses of the chi-square test of differences in probabilities are (if the col- 
umn sums are the fixed categories): 
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H, : All probabilities in the same row are equal to each other (p,, = pz =°" = p,, ) for all i. 


H,: : At least two probabilities are not equal to one another (p, 4 p, for some i and some 
pair j and k). [Ref. 3: p. 154]. 


In other words, in Figure 4 , this hypothesis suggests that the probability that a 
person in the good character-of-service category is male is equal to the probability that 
a person of the misconduct character-of-service categorv is male and so forth for the 
other categories. 


Alternatively, the null hypothesis, H,, is defined as: 


H,: Pr [ an observation falls in cell i,j ] 
= Pr{ an observation falls in rowi ] x Pr [an observation falls in column j] 
for all i, j. 


The expected count in each cell, under the null hypothesis of equal probability, can 


be defined as follows: 





The standardized difference between the actual cell count and the expected cell 


count 1s called the signed-residual and 1s calculated in the following manner: 


OS a 


——— 


y i 
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The test statistic, Q, is determined by summing the squared residuals: 


m n : 
O = (0; —_ i) 
ey 

i=] Jj=1 


Q is, for large enough cell entries, distributed as a chi-square statistic with 


(m— 1) x (n— 1) degrees of freedom. The chi-square statistic Q, it’s degrees of freedom, 
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and the 95th quantile of the matching chi-squared distribution are listed in Figure 4 to 
the left of the graph to aid in evaluating the hypotheses. 

The manner in which the cross-tabulated information was displayed allowed the 
chi-square test of independence to be performed in a relatively simple manner. The test 
required only a quick comparison of the test statistic and the 95th quantile to evaluate 
the null hypothesis that the probabilities in each of the same row were equal. 

There remained an evaluation of the expected cell sizes for appropriate size for the 
chi-square distribution of the test statistic to be valid and for the test to be meaningful. 
The generally accepted rule that the expected value of each cell, e, should be larger than 
5.0 was followed, and if this were not so the number of categories should be reduced by 
combining like categories. There seems to be some support for an argument that, given 
large enough samples, an expected cell size even as small as one is acceptable. However, 
the purpose of this thesis 1s not to argue this point, and therefore the rule was followed. 
fiers : p. 156} 

At this point in the analysis, if expected cell sizes were too small, hke rows were 
combined and the cross-tabulation was reiterated, or, if expected cell sizes were large 


enough, the cross-tabulation was accepted as performed. 


B. RESIDUAL ANALYSIS 

The chi-square test of independence conducted with a contingency table is verv 
general and does not say anything specific relating to the likely dependencies of partic- 
ular categories within the variable vector. 

For example, incidents of prior drug use, cross-tabulated with character-of-service 
may indicate a lack of independence; however, it is difficult to say, specifically, that lack 
of prior drug use indicates a higher than expected probability of good service, or that 
high levels of prior drug use indicates a lower than expected probability of good service, 
or both, even when the chi-square test of differences in probabilities indicates that the 
probabilities that a member of a fixed category is classified in the each of the free cate- 
gories is not equal. The problem becomes more severe with increase in the number of 
categories, though for a 2 by 2 table there are clear constraints because a large number 
in one cell forces the other cell to be small. Thus an empirical method of determining 
components of dependence information 1s desired. 

The residuals produced within each cell represent a rough way of determining such 
dependency: a higher magnitude of the residual seems to indicate a larger discrepancy 


from the expected value of the cell size given independence. If the sample is large 
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enough, the signed residual is approximately normally distributed. Thus we can score 
all residuals and select those whose values are greater than, sav, the 95th quantile of the 
standard normal distribution. However, we cannot make this assuniption when dealing 
with more than one residual produced from two dependent variables. It is difficult to 
say much about the distribution of the maximum residual produced from two variable 
vectors even under the null hypothesis of independence. 

The basis of the analysis presented in this thesis is to look at just the sort of de- 
pendency mentioned above and some way was sought to analyze the residuals to provide 
an acceptable basis for asserting that the particular dependencies between variables 
existed. 

Residual size appeared to be the key to determining the particular dependencies 
within variables which have been shown to not be independent. Ideally, all residuals 
would be distributed according to some known distribution, and a residual with an un- 
likely value under this distribution would indicate a particular dependency. There is no 
basis for such an ideal situation. A technique which 1s available, however, can provide 
an empirical method to give insight into this distribution. This method 1s known as the 
bootstrap-simulation technique. 

The premuse behind the bootstrapping technique is that there exist many situations 
where it 1s desirable to have distributional information about random variables where 
such information is unknown or difficult to generalize. In cases where such information 
1s unknown but the distribution of the underlying basis for the random process is known, 
then to simulate the process and sample the random variables created allows an emprr- 
ical analysis to be conducted, especially of the quantiles of the unknown distribution. 

The basis for the bootstrapping used in this thesis is that independent variables 
produce counts in the contingency tables according to the null hvpothesis explained in 
the previous section. That is, the probability that a random variable belongs in a par- 
ticular cell (1,}) is equal to the probability that it belongs in the particular row category 
1 times the probability that it belongs in the particular column category j. Taking this 
hypothesis a step further, the probability that a count belongs in a particular cell of a 
contingency table formed by two independent variables can be estimated by dividing 
each cell’s expected value by the total counts contained in the table. The total of all the 
estimated probabilities sum to one and the distribution of counts within the table are 


essentially multinomial with n total counts and the probabilities of each cell. 
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With this in mind, a monte-carlo simulation of any particular contingency table 
(with particular marginal counts) can be performed using multinomial random numbers. 
The distribution of the largest absolute residual can be empirically generated with a large 
number of replications. This allows the comparison of the 95th quantile residual gen- 
erated by this simulation of independent variables versus the residuals produced in an 
actual contingency table, where the hypothesis of independence has been rejected. 

In effect, the comparison of the residuals against the 95th quantile bootstrap resi- 
dual is a test of the null hypothesis: 


H,: The residual does not indicate a direct relationship between the two factors which join 


to form a cell. 


If the magnitude of a particular residual is larger than the bootstrap residual, then 
we reject this H, at a level of significance of 0.05. 

For this thesis a FORTRAN program was written which used as input the actual 
marginal totals produced in each cross- tabulation performed. These marginal totals 
were used to generate the expected values for each cell under the null hypothesis that the 
variables were independent. Probabilities for each cell were computed. The IMSL mul- 
tinomial random number generator was used to generate counts based upon these 
probabilities and the total count for the table. The simulation was replicated 200 times 
for each table. Residuals were generated for each cell in the exact same manner as used 
in the cross-tabulation function for each replication. The largest absolute residual for 
each replication was determined and saved. An empirical distribution of the largest re- 
siduals of a particular m by n contingency table with particular marginal totals was 
available. The statistics of interest (in particular,upper quantiles) were available for 
comparison with the residuals produced by the cross- tabulation of the actual variable 
vectors. Appendix C contains the FORTRAN program, Appendix D contains the input 
file, and Appendix E contains the output file. 

Comparison of the actual residuals with the 95th quantile largest residual produced 
in the simulation allowed a determination of the comparative size of residuals and allows 


some insight into particular dependencies within each contingency table. 


C. APPLICATION TO THE ACTUAL DATA 
There were two major types of independent variables available from the data: de- 


rogatorv information consisting of crimes and negative items of information uncovered 
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by investigators, and recommendations from friends, family members and acquaintances 
of the subject. 

There were two primary response variables available in the data base: character- 
of-service and type of discharge. The character-of-service represents the manner in 
which the subject performed during his period of service. If the individual completed 
his enlistment without incident, his service is characterized as good, otherwise the indi- 
vidual has an adverse entry in his file and was released from service early. The categories 
of service are: 

1. Good. 
2. Homosexual activity or inclination. 
3, Misconduct. 
4. Drug or alcohol abuse. 


Court martial. 


Ca 


6. Character or behavior disorder. 


Type of discharge (honorable, general, and other than honorable) is highly depend- 
ent upon the character-of-service. A subject with a negative character-of-service cate- 
gory sometimes receives an honorable discharge, while a subject with a good character 
of service never receives anything but an honorable discharge. For this reason, the 
character-of-service was used in this thesis as the response variable in all cases. In fact, 
because the records were selected based upon the character of service categories, the 


character-of-service would appear to be the onlv reasonable response variable. 


D. ANALYSIS OF DEROGATORY INFORMATION 
1. General 
Derogatory information uncovered by the investigators came from the high 
schools, colleges, emplovers, neighbors, national agencies (FBI and police departments), 
local agencies, credit bureaus and other sources (‘other sources’ represents sources of 
information exclusive of the other, explicitly-stated sources). The code or codes repres- 
enting derogatory information obtained from each of the source categories for each in- 
vestigation subject was listed in the data base in the appropriate column variable vector. 
2. Tests for Differences in Probability Involving Derogatory Information 
The derogatory information of interest was grouped and collated into a new, 


coded variable vector and cross-tabulated with the character-of-service. 
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Groupings were determined to confirm relationships which should exist (record of prior 
drug use nught indicate a higher than normal result of drug and alcohol character-of- 
service, for example) and also to answer some specific questions PERSEREC was inter- 
ested in investigating. 

Table 4 summarizes the results of the cross-tabulations and tests for independ- 
ence concerning derogatory information (the response variable in all cases is the char- 


acter-of-service), which are shown in later parts of this section. 


Table 4. SUMMARIZED RESULTS OF CROSS-TABULATIONS: Involving 


derogatory and other information. 
Independent Variable Chi-Square | Degrees of | .95 Level of | Depend- 
Statistic Freedom Significance ee 


n 
Sex (male female) Ro. “aa LOF ay 
High School Diploma 18.07 ee 

eee es te 18.31 


Beeseators Info. Dis- 29.81 11.07 yes 
closed 


Incident of aie I Crime 30; m—not Ss. Titer” | oa | 
/Adjustment Incidents [17.07 | SL OT] ves 


Category of Most Serious | 60.46 10 18.31 ves 
Drug Use 











AS vou can see, all of the cross-tabulations listed above indicate a difference in 
structure between character-of-service categories except for the age at enlistment-char- 
acter-of-service comparison. 

3. Residual Analysis Involving Derogatory Information 

Residual analvsis of the cross-tabulations is explained in the following tables. 
Negative residual values indicate that the actual cell count was lower than the expected 
value. This indicates a negative relationship. The opposite holds for positive residuals. 

As displaved in Table 5 , the incidence of court martials is significantly low 


among women (the second largest residual in absolute value, -3.02), but character and 


an 


behavior disorders are significantly more prevalent among women (largest residual in 


absolute value). 


SUBJECT SEX VS. CHARACTER OF SERVICE 
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49.60 
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Figure 5. The Cross-Tabulation of Sex vs. Character-of-Service: This is a another 


copy of Figure 4. 
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Table 5. RESIDUAL ANALYSIS OF SEX VS: CHARACTER- 
OFFSERVICE: For Figure 5 
Independent Variable 95 Quan- Significant Categorical Residual 
tile Boot- Relationship Value 


strap 
Residual 


Value 


Female - Court Martial 


Female - Character Be- 3.58 
havior 





Table 6 shows that those individuals receiving court martials and being dis- 


charged have a significantly higher incidence of being non-graduates of high school. 


HIGH SCHOOL DIPLOMA BEARER VS. CHARACTER OF SERVICE 
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Figure 6. The Cross-Tabulation of High School vs. Character-of-Service. 
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Table 6. RESIDUAL ANALYSIS OF HIGH SCHOOL VS. CHARACTER- OF- 
SERVICE: For Figure 6 


Independent Variable 95 Quan- Significant Categorical Residual 
tile Boot- Relationship Value 
strap 
Residual 


Value 


High School Diploma 2.67 Nongraduate - Court 2.85 
Martal 





AS intuition, and Table 7 suggests, those individuals who have no derogatory 
information disclosed in their investigation were significantly more likely to have suc- 


cessfully completed their service {received a good character-of-service designation). 
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DEROGATORY INFORMATION DISCLOSED IN INVESTIGATION VS. CHARACTER OF SERVICE 
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Figure 7. The Cross-Tabulation of Derogatory vs. Character-of-Service. 


Table 7. RESIDUAL ANALYSIS OF DEROGATORY VS. CHARACTER- OF- 
SERVICE: For Figure 7 


Independent Variable 95 Quan- Significant Categorical Residual 
tile Boot- Relationship Value 
strap 
Residual 


Value 


Derogatory Information eek No Information - Good 3.66 
Disclosed 





Py) 


Looking at Figure 8 we see that persons who had a record of major crime (fel- 
ony-related derogatory information) Were more likely to be discharged for courts martial 


while persons with a major crime on record were less likely to be discharged with a good 


character-of-service. 


SUBJECTS WITH AN INCIDENT OF MAJOR CRIME DISCLOSED IN INV. VS. CHARACTER OF SVC. 
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Figure 8. The Cross-Tabulation of Major Crime vs. Character-of-Service. 
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Table 8. RESIDUAL ANALYSIS OF MAJOR CRIME VS. CHARACTER- OF- 
SERVICE: For Figure 8 on page 31 


Independent Variable 0.95 Quan- | Significant Categorical Residual 
tile Boot- Relationship Value 
strap 
Residual 


Value 


Major Crime Disclosed in On Record - Good 


the Investigation On Record - Court Mar- 
tial 





Table 9, Figure 9, Figure 10, and Figure 11 all refer to drug abuse. Figure 9 
considers the incidence of drug use alone. Figure 10 is concerned with substance abuse 
in general by including alcohol-related incidents along with drug use. Figure 11 looks 
at drug use by categories of seriousness, from none, to marijuana only, to any involve- 
ment of a more serious nature. Evidence of drug’alcohol abuse prior to the investigation 
is highly associated with the drug‘alcohol character-of-service (Figure 10). Records 
with no evidence of prior use were more hkely to result in the good character-of-service. 
Combining the prior alcohol incidents did not significantly alter this relationship 
(Figure 11). There was a strong relationship between those court martialed and a record 
of hard drug use (drugs stronger than marijuana), shown in Figure 11. Prior evidence 


of marijuana use is less likely to result in a good character-of-service (Figure 11). 


jul 


DRUG USE DISCLOSED IN INVESTIGATION VS. CHARACTER OF SERVICE 
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Figure 9. The Cross-Tabulation of Drug Abuse vs. Character-of-Service. 


DRUG/ALCOHOL ABUSE DISCLOSED IN INVESTIGATION VS. CHARACTER OF SERVICE 
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Figure 10. The Cross-Tabulation of Drug/Alc. Abuse vs. Character-of-Service. 
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CATEGORY OF MOST SERIOUS DRUG USE DISCLOSED VS. CHARACTER OF SERVICE 
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Figure 11. The Cross-Tabulation of Most Serious Drug Abuse vs. Char.-of-Sve. 
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Table 9. RESIDUAL ANALYSIS OF DRUG USE VS. CHARACTER-OF- SER- 
NaCr 


Independent Variable 0.95 Quan- | Significant Categorical Residual 
tile Boot- Relationship Value 
Strap 
Residual 
Value 


Drug Use Disclosed (Fig- }DrugUse-Good Use - }DrugUse-Good 


Urey 9) 


Drug Use - Drug’Alcohol 
Drug/Alc - Good -3.59 
Drug/Alc - Drug/Alc 


Category of Most Serious =— Maryuana - Good -3.10 


Drug/Alcohol Abuse Dis- 
closed (Figure 10) 


Drug Use (Figure 11) 





Hard Drugs - Court Mar- 
tial 


The cross-tabulation of age versus character-of-service shown in Figure 12 1n- 


dicated no separate association between the various character of service categories. 
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AGE AT ENUSTMENT VS. CHARACTER OF SERVICE 
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Figure 12. The Cross-Tabulations of Age Versus Character-of-Service 


4. General Comments About the Derogatory Information Cross-Tabulation 

There are some general comments which follow from the analysis of the derog- 
atory information cross-tabulations displayed in Figures 5 through 12. Significantly 
lower than expected amounts of derogatory information were associated with the good 
character-of-service category in almost any cross-tabulation performed. The residual 
value was not always large enough to exceed the 95th quantile bootstrap residual; how- 
ever, It 1s significant that persons with relatively low amounts of derogatorv information 
are almost always associated with the good character-of-service. Some amounts of cer- 
tain types of information are more strongly associated with some categories than others. 
But some “good” people had “bad” information in their files and some “bad” people had 
no “bad” information in their file. All that can really be inferred is trends and tendencies, 


not predictions. There does appear to be plenty of evidence to indicate caution should 
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be exercised when investigations indicate serious derogatory information. The appear- 
ance of derogatory information does not guarantee a problem service member, however, 
the government is indeed taking a higher risk that the service of such a person will ter- 
minate early. The mere existence of derogatory information does not indicate that a 
service member will leave service early (roughly 40% of subjects with good character- 
of-service had at least one item revealed in their investigation); but, of those that left 


Service With an adverse discharge, 80% had at least one item of derogatory information. 


E. ANALYSIS OF RECOMMENDATION DATA 
The investigators interview persons who know the subject of an investigation and 
ask them: 
1. Would you recommend this person for a position of trust? 
2. Or, would you recommend that person for a position of trust with supervision? 
3. Or, would you not recommend them? 
4 


Or, do you decline to comment? 


This information 1s used, along with everything else, to determine if the person gets 
the security clearance. Questions that came to mind when the data was investigated 


were. 


]. “What do positive and negative recommendations tell us 1n relation to the subse- 
quent character-of-service? 


2. Can they tell us something separately? 


3. Can they be combined, by some scoring system, to allow us to show some re- 
lationship between the recommendations received (both positive and negative) and 
the subsequent character-of-service?” 


The incidence of negative recommendations 1s relatively rare in this data base (only 
73 persons had at least one “not recommended”, and few of them had more than one). 
The recommendation information was cross-tabulated against the character-of-ser- 


vice variable in several ways: 


1. A total of the number of recommendations each individual received was made (case 
he). 


2. A total of the number of “not recommended” each individual received was made 
(case 2). 


3. A recommendation score was constructed with a recommendation having a +1 
value and all other recommendation categories having a -1 value (case 3), called the 
adjusted recommendation score. 


by 


4. A recommendation score was constructed with a recommendation having a +1 
value, recommendation with supervision and decline comment having a -1 value, 
and not recommended having a -3 value (case 4), called the weighted recommen- 
dation score. 


Figure 13, Figure 14, Figure 15, and Figure 16 below show the cross-tabulations of the 
four recommendation cases discussed in the previous paragraph versus the character- 
of-service variables. 

The cross-tabulations of the four recommendation scoring methods with the 
character-of-service are summarized in Table 10. 

Table 10 shows, for all cases, that the hypothesis of equal probabilities for the 
recommendation levels between the character-of-service categories should be strongly 


rejected: 


NUMBER OF RECOMMENDATIONS DISCLOSED IN INVESTIGATION VS. CHARACTER—OF—SERVICE 
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Figure 13. The Cross-Tabulation of Recommendations vs. Character-of-Service 
(Case 1). 


38 


NUMBER OF NOT RECOMMENDED DISCLOSED IN INVESTIGATION VS. CHARACTER OF SERVICE 
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Figure 14. The Cross-Tabulation of Not Recommended vs. Character-of-Service 
(Case 2). 
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ADJUSTED RECOMMENDATION SCORE VS. CHARACTER—OF—SERVICE 
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Figure 15. The Cross-Tabulation of Adjusted Recs. vs. Character-of-Service (Case 
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WEIGHTED RECOMMENDATION SCORE VS. CHARACTER-OF—-SERVICE 
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The Cross-Tabulation of Wtd. Rec. Score vs. Character-of-Service (Case 


4). 


Figure 16. 
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Table 10. RESULTS OF CROSS-TABULATIONS INVOLVING RECOMMEN- 
DATIONS: | for figures 13 through 16. 


Independent Variable Chi-Square | Degrees of | .95 Level of | Depend- 
Statistic ae Significance | ency? 
Total Recommended $2.89 25.00 yes 
(Case 1) for Fictites!: 
Total Not Recommended | 17.55 5 11.07 ves 
(Case 2), for Figure 14 
Recommendation Score 84.55 15 25.00 yes 
(Gase 3), forseicure 15 
Weighted Recommenda- 90.40 15 25.00 ves 
tion Score (Case 4), for 
Fieure 16 


It is interesting to note that the inclusion of the negative recommendation data does 







not strongly change the result of the test (see Figure 13, Figure 14, Figure 15, and 
Figure 16), although the chi-square test statistic does increase as negative recommen- 
dations are given greater weights. It seems logical that the negative recommendations 
should be included, because they affected approximately 15% of the scores. Including 
them is also more encompassing and should give a truer overall characterization. 

Analysis of the residuals produced by the recommendation score cross-tabulations 
with their bootstrap residual values showed several significant relationships listed in 
abicw le 
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Table 11. RESIDUAL ANALYSIS OF RECOMMENDATION VS. CHARAC- 
TER-OF-SERVICE 


Independent Variable 0.95 Quan- | Significant Categorical Residual 
tile Boot- Relationship Value 
strap 
Residual 
Value 


Number of Recommen- No Recommendations - -4.54 
dations (Case 1) Good 


No Recommendations - 
Misconduct 
Recommendation Score Score < 0 - Good 


(Case 3) Score < 0 - Misconduct 
Score < 0 - Drug Alcohol 


Weighted Recommedation | 3.04 Score < 0 - Good 
Score (Case 4) 





The residual analysis shows clearly that having at least one recommendation (case 
1) is highly associated with the good character-of-service category (of 205 persons, onlv 
one had no recommendations at all). Similarly, when negative recommendations are 
added in (cases 3 and 4), a low number of negative recommendations is highlv associated 
with this good category (only 15 out of 205 had any not recommended). This is shown 
by the residual analysis to be strongly significant. 

Low recommendation scores were strongly associated with the misconduct and 
drug‘alcohol abuse character-of-service categories. Low recommendation scores were 
not similarly associated with the other three adverse character-of-service categories (ho- 
mosexual, court martial, or behavior disorder). 

Note that in the total of recommendations ( Figure 13), the homosexual charac- 
ter-of-service has a very large residual (3.09) associated with large numbers of recomm- 
endations. This is only slightly lower than the 95th quantile bootstrap residual value of 
3.18, and as the third largest residual in the table, 1s probably significant. A large num- 
ber of recommendations (8 or more) appears to be associated with this character-of- 
service. The high number of recommendations that those discharged with this 


character-of-service receive is very different from all other adverse categories. 
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F. FURTHER TESTING 

The results shown above certainly indicate that a relationship exists between the 
amount and type of derogatory information and recommendations contained in the in- 
vestigation records and the subsequent character of service category. More detailed 
analysis of other tvpes of derogatory information is called for and was not accomplished 


because of time constraints. 


a4 


IV. FURTHER ANALYSIS 


A. GENERAL 

The data analvsis conducted to this point of the thesis dealt with the cross-tablation 
of categorical data, the chi-square test of differences of probabilities, and analysis of the 
residuals. There are many other ways to look at this data, some of which was briefly 


investigated and which warrant further investigation. 


B. ANALYSIS OF PRODUCTIVITY OF SOURCES 

Security background investigations are expensive operations. They involve many 
man-hours of investigators’ time and much of the government’s money to conduct. The 
efficient conduct of investigations is desireable not only to save money, but to avoida 
large backload of investigations and to quickly award or reject the clearances; persons 
awaiting the clearance procedure cannot perform the Job to which they are assigned until 
the clearance is granted. Is the quality and quantity of information obtained from all 
sources equal? Is it reasonable to spend equal amounts with each source of information? 

There are several ways available to provide insight toward the answer to these 
questions. One method is to look at the total amount of information available from 
each source and to compare the amounts. Using APL logical operations, a total amount 


of information was obtained in three different categories: 


1. The amount of derogatory information obtained from each source totalled across 
all records. 


tJ 


The number of recommendations obtained from each source totalled across all 
records; 


3. The number of “not recommended” obtained from each source category totalled 
across all records. 


These totals were graphically displayed using Grafstat’s bar chart capability and the 
results are listed in Figure 17. 

The factor which was immediately apparent to me was that neighbors appear to be 
much more reticent about discussing negative factors during a background investigation 
than persons from other source categories; or, are much more likely to savy nice things 
about their neighbors and friends than are persons associated with the other source 


categories. In other words, based upon the bar charts, it may not be effective to 
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Figure 17. Productivity of Source Categories Across the Entire Data Base. 


investigate subjects’ neighbors in depth unless other areas of the investigation suggest 
that more detailed information might be obtained from neighbors. 

Another factor apparent from the investigation 1s that little information 1s obtained 
from the college source category and that it may not be cost effective to expend re- 
sources at the college of a subject unless other portions of the investigation suggest that 
information can be obtained there. 

Some important facts must be considered here before drawing specific conclusions 
from this figure. The data does not represent a random sample from the population at 
large, but a random sample from each discharge category according to an arbitrarily 
determined proportion. Each discharge category should be investigated separately to see 
if the trend apparent through the data base applies to all discharge categories in the 


same way. 
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A much more detailed mathematical investigation of this trend is needed to confirm 
the validity of this approach. The facts discussed above are strongly suggested by the 
figure, however, further analvsis is needed in order to confirm or draw conclusions from 


them. 


C. ANALYSIS OF THE WEIGHTED RECOMMENDATION SCORE 

Chapter III, part E, discussed several techniques for evaluating the recommendation 
data disclosed as part of the investigation. Several scoring systems were discussed and 
detailed as cases 1, 3, and 4. Case 1 gave a direct score, with each recommendation 
worth one point, and all negative recommendations ignored. Case 3 was similar to case 
1 but included all negative recommendations (recommendation with supervision, not 
recommended, and decline comment) scored as -] and added to the score of case 1. Case 
4, similarly awarded a score of +1 for all positive recommendations, a score of -1 for the 
two “weak” negative recommendations (decline comment and recommend with super- 
vision) and a -3 for strong negative recommendations. 

Case 4 appears to be the better method of scoring for several reasons. It awards a 
higher penalty for the negative recommendation. This higher penalty appears to be 
justified when the relative rarity of negative recommendations 1s considered. Discussion 
with analysts from PERSEREC confirmed this approach as valid, given their experience. 
The highest chi-square statistic was also obtained in the test of this scoring method 
against the character of service. 

The scoring technique provided a transformation from the strict categorical struc- 
ture of the original data into a numerical range of values. This allows us to investigate 
the possible distribution of these scores. 

The distribution of scores across all of the discharge categories together does not 
provide any particular insight into the meaning of the scores. However, the distribution 
of scores Within each distribution category might show us some particular differences. 

Grafstat, the mainframe computer statistical graphics package, contains a three-di- 
mensional empirical densitv plot capability. The separate empirical density of the re- 
commendation scores of each character-of-service category 1s displayed in a 
three-dimensional manner. This allowed rough comparisons between each empirical 
density to be made. Figure 18 displays the three-dimensional empirical density for the 


recommendation scores, with each “slice” representing a character of service category. 
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THREE-DIMENSIONAL EMPIRICAL DENSITY PLOT OF THE 
WEIGHTED RECOMMENDATION SCORES BY CHARACTER 
OF SERVICE 
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Figure 18.  3-D Empirical Density of the Weighted Recommendation Scores 


The empirical density is a relatively simple idea. Each value within a sample (1n this 
case the values are the recommendation scores) is thought to possess a density of = 
where n represents the number of samples. Therefore the total density of the each em- 
pirical density plot represents — = 1.0. The density of each sample 1s distributed around 
the score according to a particular smoothing function. The smoothing function can be 
a uniform distribution (Known as the boxcar function) or according to some other 
scheme (a smoothing function based upon the cosine function is commonly used). The 
density of each sample is calculated, and then the density at each location along the x- 
axis 1s calculated. One could say that the density from each sample is spread around 
each point for some distance. The total densitv at any particular point is the sum of any 


densities Which overlap at that point. The empirical density plot allows us to get a rough 
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idea of how the sample is grouped about particular values and may assist in determining 
the distribution of the sample. 

The three-dimensional empirical density plot of the recommendation scores also al- 
lows us to do something very practical in relation to the recommendation score. 
Figure 19 is the two-dimensional representation of the three-dimensional empirical 
density plot. Each ‘slice’ is laid flat in the two-dimensional plane. This representation 
allows us to interpret the x-axis values more precisely. What was immediately apparent 
was that the recommendation scores associated with the good character of service Were 
closely grouped about a score of +6. There are very few negative values at all. The 
densities of the adverse character of service categories are much more spread out and 
have significantly more densitv in the negative score range. Note particularly the num- 
ber of enlisted men who had very negative weighted recommendation scores and who 


were subsequently discharged for misconduct. 
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EMPIRICAL DENSITY OF WID RECOMMENDATION SCORE 
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Figure 19. Empirical Density Plot of Weighted Rec. Score by Category 


A quick visual analysis of the empirical density plots suggests that the recommen- 
dation scoring technique could assist the adjudicator in evaluating a investigation record. 
If all persons with a recommendation score of less than zero were denied clearance, we 
could be reasonably assured that we would reject few persons who would complete their 
term of service successfully; however, we could also be reasonably assured to reject a 
significant number of people who would later be discharged for adverse reasons. Of 
course, this consideration must be weighted against the high proportion of the popu- 
lation who receive the good character-of-service versus the relatively low number who 
receive adverse characters-of-service. Two percent of the good category probably con- 


sists of many more people than 44% of the misconduct category. Table 12 displays the 
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percentage of each character-of-service category who had a non-positive weighted re- 
conimendation score. The percentage of the genera] population who receive each of the 


six Character-of-service discharge categories is shown for comparison.] 


Table 12. PERCENTAGE OF NON-POSITTIVE RECOMMENDATION 
SCORES 


Character-of-Service Dis- Number in’ | Number Percentage | Percentage 
charge Category Sample. With With Who Re- 
Score < 0 Score < 0 ceive This 
Category in 
General 
Population 


v0.86 


18.57% [0.99% 
61% [12% 
Baie Alcohol Abuse na 31.39% 1.8% 


Court Martial 24.65% 0.18% 


Character and Behavior 13 20.00% 0.65°o 
Disorder 





Further analysis of the recommendation scoring methods is clearly indicated. Re- 
commendation data makes up a large proportion of this data base. Comparisons be- 
tween recommendations and derogatory information may also yield important 
information. It is important to note that the vast majority of persons who are investi- 
gated for a security clearance finish their service successfully and the adverse discharges 
are not common. The proportion of records in each character-of-service category is not 


in any relation to their appearance in the actual population. 


1 For all consideration in this thesis, the “general population” consists of those who have se- 
cunty investigations conducted. 
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V. CONCLUSIONS AND RECOMMENDATIONS 


A. FINDINGS 
1. Cross Tabulation and Residual Analysis 
a. General Findings 

There 1s clear indication that the distribution of investigative data is 
different for each character-of-service category, especially for each adverse category 
when compared to the good character-of-service. 

The good character-of-service category is generally described by sparse 
derogatory information, verv few negative recommendations and a relatively high 
recommendation score. In fact, persons receiving this characterization of service usually 
had several or more recommendations. 

The misconduct, drug'alcohol abuse and court-martial character of service 
category generally exhibit an opposite trend in regard to derogatory information and 
recommendations. These categories tend to have significantly more derogatory 
information and significantly lower recommendation scores. 

The homosexual and character and behavior disorder categories exhibit the 
general trend of the other adverse character-of-service categories, however, their values 
tend to be much less pronounced. It is more difficult to generalize their attributes with 
those of the other categories. 

b. Specific Findings 

Age of the investigation subject bears no particular relation to anv 
character-of-service category. 

Female service-members comprise a significantly lower percentage of the 
drug‘acohol abuse and court-martial character-of-service categories; however, women 
display a significantly higher percentage of the character and behavior disorder category. 

The lack of a high school diploma is significantly associated with the court 
martial character-of-service, but is not significantly associated with any of the other 
categories. 

The lack of any derogatory information 1s significantly associated with only 
the good character-of-service category, however, 40% of persons in this category had 


at least one item of derogatory information in their file. The number of people with at 


‘f1 
tJ 


least one item of derogatory information in the adverse categories tended to be around 
80%. 

Drug abuse disclosed in the investigation 1s significantly associated with the 
drug/alcohol character-of-service category. Nearly 50% of those whose service was ter- 
minated because of drug abuse had evidence of drug abuse disclosed in their investi- 
gations. The misconduct category showed similar, and only slightlv less significant, 
association. Persons who completed service with a good character-of-service evidence 
significantly lower evidence of drug use in their investigations. It is apparent that drug 
abuse disclosed in the investigation should be carefully weighed and could prove to be 
a major discriminator in the adjudication of a clearance. 

The proportion of recommendations varied significantly among all the 
character-of-service categories. In fact, a remarkable attribute of those who received a 
good character-of-service is that only one out of 205 in the sample received no rec- 
ommendations at all. 

The recommendation scoring technique appears to be a valuable method 
of generalizing the recommendations received. This scoring system is better than look- 
ing at positive or negative recommendations alone because it involves more of the data 
and provides a means of balancing between the two extremes. The cross-tabulation of 
the recommendation scoring showed that a low score is not a characteristic of the good 
character-of-service. 

2. Productivity of Sources 
A brief look at the amount of derogatory information, number of recommen- 
dations, and number of negative recommendations shows that the neighborhood and 
college sources mav not be the best place to expend limited investigative resources 
compared with high schools, employers, and the other developed sources. 
3. Weighted Recommendation Score as a Predictor 
The brief look at the three-dimensional empirical density plot of the weighted 
recommendation scores indicates that this scoring method may prove valuable when 
adjudicating investigations. Based upon this data, rejecting any subject with a score less 
than 0 would have a negligible effect upon the persons who later receive a good char- 
acter-of-service while eliminating a significant number of persons who would later have 


received an adverse discharge. 


B. RECOMMENDATIONS FOR FURTHER STUDY INVOLVING THIS DATA 

The actual data analvsis conducted in this thesis was of those factors which ap- 
peared obvious. There 1s much more analvsis which can and should be done. The data 
is now configured for ease of use by someone with an elementary knowlege of APL and 
Grafstat. 


The recommendations are: 


1. Continued research should be conducted into the relationships between the char- 
acter-of-service categories and particular groupings of investigation data. 


2. Further investigation 1s warranted into various productivity of source issues hinted 
at during this investigation. Investigations into productivity of source by infor- 
mation type, and how the sources relate to the various character-of-service cate- 
gories would appear to be fertile areas available for analysis. 


3. The recommendation scoring techniques should be further analyzed to find an op- 
timal score value for each tvpe of recommendation. It seems reasonable that this 
scoring system could be very useful as another tool available to the investigation 
adjudicator when evaluating investigation records. 

C. RECOMMENDATIONS FOR FURTHER STUDIES 

The work accomplished in this thesis has shown that considerable manipulation of 
a large data base can be accomplished using APL. With proper design, and the newer 
versions of APL (APL2) and faster mainframes and microcomputers even larger data 
sets can be efficiently manipulated and analvzed. 

A study involving a large sample from the investigative population at large should 
be conducted. Investigations similar to what has been conducted here along with anal- 


ysis of attributes of the general population should be conducted. 


APPENDIX A. FREQUENCY TABULATIONS OF EACH VARIABLE 


This appendix contains frequency tabulations of each categorical (numerically 
coded) variable. Some variables are given alpha-numeric names (for example AA1), 
because that particular source provided multiple pieces of information for some records. 
These tabulations were performed using the Codebook Procedure of Statgraphics (ver- 
sion 2.6). The tabulations headings were edited to make them easier to understand. 


Labels are limited to 10 characters within Statgraphics. 


Variable C, Sex 


Sex Frequency Codes 
Male 479 1 
Female 85 2 


Variable I, Reason for Background Investigation 


Reason Frequency Codes 
Acc Class 315 1 
Nuc Wpn Hi2 2 
SCI 5 3 
Pres Spt io 2 4 


Variable J, Navy Occupation Code 


Occupation Frequency Codes 
BE/E 6 1 
CTA 2 ea 
Colgll ye 1 
CTM 13 Ls: 
CTO 15 14 
CTR 5 15 
Cae T 7 16 
Couigd 28 il 7 
err /R 1 18 
GM iL Zi 
GMT 38 Zo 
is Zl ou 
MS 2 41 
Pi 33 By 
PS 4 Bye 
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OW 1 Tl 
SWSE JE Ta 
at 4 81 
TMS 34 82 
peer 18 83 
UFT 17 vil 


Variable K, Reason for Interview 


Reason Frequency Codes 
Int B inte Gis) 79 1 
Stmt Ps Hs (Sr) sles Z 
Not Conducted 216 a) 
Required 118 4 
Unfav. Inf 76 5 


Variable Ll, Interview Information #1 


Information Frequency Codes 
Bree Ene 1 1000 
Burglary 2 1001 
Robbery 1 1042 
Theft 6 1043 
sus Theft 2 1044 
Assault 5 1050 
Rape 1 152 
Alcohol Pr 4 L19¢ 
DUP 1 2 eT 
Liq Law Vi il 1104 
Mim vA o Z GS 
PubIntox>1 1 Oy 
Marij Arr eS) 1120 
Mar ijgeae da ie 
Mari ieo-es 6 Liz2 
Martie 2s 2 iW e72 3} 
Susp Drug 2 1140 
DrgUse 1-5 7 1141 
Credit 4 2519, 
Bad Checks Z 1230 
Unpd Bills IS) 1233 
Homo Susp 2 1422 
Tresspass 1 SS EW, 
Fired IS) 1561 
Suspnd Sch 7 1564 
susCommSym 2 1600 
Depression i Ul ee25) 
Leave Acct 2 1804 
Mov Veh 1 8 1810 
Mov Veh2-5 5 1811 
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Mov Veh >5 1 1812 
Dmg Per Pr 2 1841 
Juven Rec 6 1843 
Runaway 2 1845 
Voyeur 1 1847 
wob NotSPH 2 197 
No Informa 368 9999 


Variable L2, Interview Information #2 


Information Frequency Codes 
Pek & Ent 2 1000 
Burglary 1 1001 
Theft 5 1043 
Assault Z 1050 
Battery 1 TOS! 
Alcohol Pr ial 1100 
poy 1 i 1101 
wo >) 1 PEO2 
ian Alc Po 2 LOS 
PubIntox 1 1 1106 
Marij Arr 4 1120 
ierij 1-5 14 T1271 
Marij 6-25 1 Alas 72 
Susp Drug 1 1140 
DrgUse 1-5 4 1141 
Sale Drugs S 1150 
Drug Waivr 2 1160 
Credit 2 1210 
Bad Checks 3 1230 
sued Nonpa 1 22 
Mopa Bills 5 1235 
Veh Reposs 1 1225 
Homo Admit IL 1421 
Disrespect 1 ISSN, 
Disag Empl 2 152 
Perf Unsat 1 1538 
Disenr Sch il 1540 
Fired 10 56 1 
Suspnd Sch 6 1564 
Absnt Freq ) 1570 
Immat Beha 1 ye, 
Emot Prob 1 1701 
Depression 1 L239 
Leave Acct il 1804 
Mov Veh 1 5 1610 
Mov Veh2-5 i dleey l 
Stnd Veh 1 2 LAs 
Disord Con 5 1821 
Dstb Peace 1 ie22 
Harrassmnt 1 le23 
Lisc Suspd 3 1825 
Crim Misch 2 1840 
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Juven Rec 5 1843 
Mal Misenet i! 1844 
Runaway J 1845 
Fraud Enli i 1922 
Job NotSPH 3 1923 


Variable L3, Interview Information #3 


Information Frequency Codes 
Robbery 6 1043 
Sus Theft i) 1044 
Assault 2 1050 
Incorrigbl i aerial 
Alcohol Pr 6 LLCO 
DUI 1 1 1.07) 
Drnk & Dis i L103 
Liq Law Vi 1 are 
Min Alc Po 5 IONS 
Marij Arr 3 E20 
Mari; 1-5 5 1121 
DrgUse 1-5 1 1141 
Unpd Bills Z WES: 
Juv Homo at WES 
Perf Unsat 1 lac 
Fired if 1561 
Absnt Freq i) 1570 
Tardy 1 a7 7 
SusCommSym 1 1600 
Mov Veh 1 6 1810 
Dstb Peace 1 eZ 
Lise Suspd J Tez 5 
Vandalism 1 nhs) 28 
Cram Mrsen 1 1840 
Dmg Per Pr 1 1841 
Juven Rec 6) 1843 
Mal Mischf i 1844 
Add NotSPH it 1920 
Job NotSPH i 1923 
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Variable L4, Interview Information 


Information Frequency Codes 
Robbery a: 1043 
Lied 1 1071 
Alcohol Pr Z 1100 
DUI a i Te 1 
ubsem le, lee i 1S 
Fup dintox) 1 1106 
Mari ei-> 2, daz 1 
Susp Drug 1 1140 


BeeUse 1-5 5 1141 
Drug Waivr i 1160 
Homo Susp 1 1422 
Fem Impers 1 1430 
Un Absense I SU) Ih 
Fired i aed 
Emot Prob 1 1700) 
Mov Veh2-5 1 1811 
Lisc Suspd 1 iez> 
Juven Rec 1 1843 
Mal Mischf 1 1844 


Variable Ml, NAC Checks Findings #1 


Prcormation “Frequency Codes 
Brk & Ent il 1000 
Burglary 2 E80) 
Forgery 1 1010 
Theft 4 1043 
Assault 1 1050 
Rape 1 TOS2 
POL 1 2 1101 
Marij Arr Z 1120 
ard} 1-5 i 1121 
DrgUse 1-5 1 1141 
Bad Checks 2 1260 
Homo Susp 1 1422 
Vio Parole ih 1502 
Not El Reh 1 1537 
Tng Dischg i 1543 
Mov Veh 1 1 1810 
Disord Con I 1821 
Harrassmnt 1 TeZzs 
Dmg Per Pr iL 1841 
No Informa 538 9999 


Variable M2, National Agency Check Findings 


Information Frequency Codes 
Burglary 4 1001 
Robbery 1 1042 
Theft 1. 1043 
DUI 1 1 1101 
Tresspass 1 ies 
Reck Drive I. 1805 
Disord Con 1 1821 
Juven Rec i 1843 
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Variable Nl, Local Agency Check Findings #1 


Information Frequency Codes 
Burglary 6 1001 
Larceny 1 1040 
Theft Hal 1043 
Sus Theft i 1044 
Assault 3 1050 
Rape i LOS 2 
Aléochol Pr i) TIGO 
IEC Lael 6 nai 
DUI>1 2 1102 
Lig Law Vi 5 1104 
Min Alc Po 10 1105 
Publntox, 1 3 1106 
PubIntox>1 1 LO? 
Maida Axo 4 InkZ 0 
Marij 1-5 2 lay 
Marij 6-25 i Taz 
DrgUse 1-5 2 1141 
Bad Checks 5 1230 
Sued Nonpa Z LS 2 
Unpd Bills 1 1233 
Tresspass Z ISS, 
Hit @enun 1 1803 
Leave Acct 5 1804 
Reck Drive Ss) 1805 
Mov Veh 1 45 ale) 39 
Mov Veh2-5 208 130) 
Mov Veh >5 3 1812 
Stnd Veh 1 il Tei3 
Disord Con 2 1821 
Crim siaseh 1 1840 
Dmg Per Pr 2 1841 
Juven Rec 31 1843 
No Informa 373 9999 


Variable N2, Local Agency Check Findings #2 


Information Frequency Codes 
Brk & Ent 5 1000 
Burglary 1 1G. 
Theft a 1043 
Battery 3 Osa 
Incor.i12bil i 1070 
Alcohol Pr 2 1100 
DOT 4 1101 
DUI>1 1 LiG2 
Liq Law Vi 1 1104 
Min Alc Po 1 1105 
PubIntox 1 2 1106 
Marij Arr 1 2 © 


60 


Mari} 1-5 4 bol 
faci) 6-25 1 al ay 
DrgUse 1-5 1 1141 
Bad Checks 1 L230 
No Restit 1 E250 
sued Nonpa 1 hz 2 
Homo Susp 1 1422 
Vio Parole 1 mele 
Fail Aper 10 572 
Leave Acct 1 1804 
Reck Drive S 1805 
Mov Veh 1 8 ve. 0 
Mov Veh2-5 7 1811 
Stnd Veh 1 i! Loi3 
Pisord Con 2. be24 
Dstb Peace 1 1822 
Harrassmnt 1 1623 
Il Firearm if 1824 
Lisc Suspd 8 a2 5 
Out Warrnt 2 1828 
Crim Misch 2 1840 
Dmg Per Pr il 1841 
Juven Rec 6 1843 
Mal Mischf 1 1844 
Voyeur 1 1847 


Variable N3, Local Agency Check Findings #3 


Information Frequency Codes 
Burglary 1 1001 
Larceny zZ 1040 
Theft 4 1043 
Assault 3 1050 
Alcohol Pr i 1100 
DUI 1 Z 1101 
Liq Law Vi 1 1104 
PubIntox 1 Z 1106 
DrgUse 1-5 1 1141 
Sale Drugs 1 LESO 
Fem Impers 1 1430 
Tresspass 2 15349 
Fail Aper 1 105 a7 
Mov Veh 1 6 1810 
Mov Veh2-5 5 1811 
Mov Veh >5 1 FoA2 
Disord Con 3 1821 
Il Firearm 1 1824 
Lisc Suspd 3 1825 
Juven Rec 1 1843 


Variable N4, Local Agency Check Findings #4 
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Information Frequency Codes 


Burglary 1 1001 
Larceny 1 1040 
Feel Prop i 1041 
Theft 1 1043 
Drnk & Dis i. EO 3 
Liq Law Vi i) 1104 
Marij Arr IL 1120 
susp Drug 1 1140 
Unpd Bills 1 1233 
Fail Aper 1 ey 2 
Leave Acct il 1804 
Mov Veh2-5 2 1811 
Harrassmnt il 1823 
Juven Rec if 1843 


Variable O1, Credit Bureau Check Findings #1 


Information Frequency Codes 
Bad Checks 3 1230 
Unpd Bills ZZ gE 89) 
Veh Reposs il P2355 
No Informa 538 9399 


Variable 02, Credit Bureau Check Findings #2 


Information Frequency Codes 
Bad Checks 1 ab S40) 
No Restit il iZ2S1 
Unpd Bills Zz £253 
Bankruptcy 1 i250 


Variable P, High School # of Sources 


# of Sources Frequency Codes 


Variable Q1, High School Recommendations 


Type Recomm. Frequency Codes 
1 Recomm 116 ileal 
4 Recomm 85 12 
3-5 Recomm ae is 
6-10Recomm 1 14 
1 RecWSu 2 21 
1 DecCom 1 Si 
3-5 DecCom il 35 
1 NotRec 7 | 
2 NotRec 2 42 
No Intervw 290 99 


Variable Q2, H.S. Recommendation #2 


Type Recomm. Frequency Codes 
IL Recomm 3 2il 
1 RecWSu 10 Bal 
2 RecWSu 1 BZ 
1 NotRec 6 a) 
2 NotRec 2 42 


Variable Q3, HS Recommendations #3 


Type Recomm. Frequency Codes 
1 RecWSu 2 om 
ih DecCom 3 Suh 
Ms DecCom i} 41 
2 NotRec 2 42 


Variable R1, HS Diploma Status 


Diploma Status Frequency Codes 
Dipl Grad 454 1 
Non Grad 81 2 
Gen Eq Dip 28, 3 


Variable R2, High School Findings #1 


Information Frequency Codes 


63 


Assault 1 1050 
incermmae D1 1 1070 
Alcohols Pr S IKeKe 
DOL 1 1 1102 
Mardaqe 1-5 S ial 2a 
Susp Drug 2 1140 
DrgUse 1-5 1 1141 
Insubordin 2 Sis. 
Perf Unsat 1 i538 
Lack Motiv 1 13563 
Suspnd Sch 2 1564 
Absnt Freq I 1570 
Braggart i} 171 
Immat Beha 10 L735 
Needs Supv I iy 
Emot Stab? i 1702 
Clmd DipIim § 22 1 
No Informa 6 Se e)s, 


Variable R3, High School Findings #2 


Information Frequency Codes 
Theft 1 1043 
Dishonest z 1060 
Mari} 1=5 1 Tae 
Susp Drug 2 1140 
DrgUse 1-5 2 1141 
Disrespect 1 1530 
Insubordin 1 Lol, 
Tng Dischg I 1543 
Lack Motiv 3 1563 
Suspnd Sch > 1564 
Absnt Freq pi 1570 
Braggart i 1S) 7 al 
Tardy 3 Tay 7 
Undependb] if 1578 
Emot Stab? 1 UFO 
Vandalism 1 ne 27 


Variable R4, High School Findings #3 


tnformation Frequency Codes 
Theft tL 1043 
Alcohol Pr 1 1100 
Susp Drug Z 1140 
Disrespect 1 1530 
Expelled 1 1560 
Absnt Freq I Ly 0 
Undependb1l] i 1576 
Emot Stab? Z 1702 
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Clmd Diplm 1 192 


Variable S, College # of Sources 


# of Sources Frequency Codes 
0 547 0 
ik a 1 
2 5 Z 
a Z 3 
4 1 4 
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Variable T, College Recommendations 


Type Recomm. Frequency Codes 
i Recomm 8 ila 
2 Recomm 5 12 
3-5 Recomm 2 1s) 
1 DecCom i on 
No Intervw 548 99 


Variable U, College Findings 


Information Frequency Codes 
Assault 1 1050 
Lied 1 Lowel 
Unpd Bills 2 bas 35 
Empl False 1 1924 
No Intervw Soy) W929 


Variable V, Employer # of Sources 


# of Sources Frequency Codes 


oa 
LL s\Z 
109 
i> 
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Variable W, Co-worker # of Sources 


## of Sources Frequency Codes 


0 220 0 
Th 118 ] 
Z 58 2 
2) 24 3 
4 4 
5 5 
6 6 


Variable X1, Co-worker Recommendations #1 


Type Recomm. Frequency Codes 


1 Recomm 89 11 
2 Recomm 91 Ny 
3-5 Recomm 162 1 
6-10Recomm 16 14 
11+ Recomm 1 eS 
2 RecWSu i Ze 
il DecCom ip? Gl 
2 DecCom 6 a2 
3-5 DecCom 2 ae 
1 NotRec 8 4 
2 NotRec i 42 
3-5 NotRec 1 43 
No Intervw 174 99 


Variable X2, Employment Recommendations #2 


Type Recomm. Frequency Codes 


1 Recomm 1 bit 
Z Recomm 1 ie 
3-5 Recomm ) IL,2) 
ii RecWSu it Pal 
Z RecWSu 1 Ze 
il DecCom 9) pal 
Z DecCom Ws: Sy 
3-5 DecCom 3 33 
1 NotRec ile 4l 
2 NotRec 4 42 
3-5 NotRec 4 43 
6-10NotRec i 44 


Variable X3, Employment Recommendations #3 


Type Recomm. Frequency Codes 


i DecCom 6 31 
6-10DecCom 1 34 
a NotRec 3 al 
2 NotRec 2 ED) 


Variable Yl, Employment Findings #1 


Maximum 
Information Frequency Codes 
Theft 1 1043 
Sus Theft 3 1044 
Lied 2 1071 
Alcohol Pr 2 1100 
Marij Arr 1 1120 
itaeij 1-5 1 1121 
Susp Drugs 2 1140 
DrgUse 1-5 il 1141 
Bad Checks ih 1230 
Disrespect 1 1530 
Not El Reh 18 1537 
Perf Unsat 3 1538 
Fired 77 1561 
Freq Quit y 1562 
Absnt Freq 7 7 0 
Immat Beha 8 Vays, 
Lack Judge 1 1574 
Needs Supv 1 1575 
Tardy 1 i577 
Undependb1l 5 1578 
Juven Rec 1 1843 
Job NotSPH 5 1923 
Time Unact 20 1924 
Empl False 3 Ne) 57 
No Intervw 82 SEIS, 


Variable Y2, Employment Findings #2 


Information Frequency Codes 
Theft 5 1043 
Sus Theft 1 1044 
Assault 1 1050 
Alcohol Pr a 1100 
aed) 1-5 5 12 
DrgUse 1-5 1 1141 
Unpd Bills 1 Loo 
Disrespect 1 1530 
Disag Em 3 1532 
Emplt Prob 1 1538 
Insubordin 1 1535 
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Misconduct 1 £536 
Not El Reh 8 P57 
Perf Unsat ri i5o0 
Fired 7 156 i 
Freq Quit 1 1562 
Lack Motiv 5 1563 
Absnt Freq 5/8: F570 
Immat Beha 6 1573 
Tardy 1 L577 
Undependb1 4 1578 
Emot Prob 1 17 07 
Emot Stab? 1 1702 
Empl False 3 1924 


Variable Y3, Employment Findings #3 


Information Frequency Codes 
Forgery i 1010 
There 3 1043 
Sus Theft Z 1044 
Alcohol Pr 4 LOO 
Mari; 2-5 1 P21 
Susp Drugs 1 1140 
Bad Checks 1 12350 
Homo Susp i 1422 
Disrespect 1 io 
Insubordin il lS) o5) 
Not El Reh 6 lee / 
Perf Unsat 1 1538 
Tresspass 2 1559 
Fired 1 i Souk 
Lack Motiv i I BioS} 
Absnt Freq > 1570 
Braggart il iy 1 
Immat Beha s 13 
Tardy 5 157 
Undependbl 2 1578 
Emot Stab? 1 1702 
Nerv Condn 1 ihe 
Empl False 3 1924 


Variable Y4, Employment Findings #4 


Information Frequency Codes 
Lied 1 LO71 
we toos los 2 iZ 1 
Susp Drugs 1 1140 
Homo Susp 1 1422 
Insubordin 1 1535 
Not El Reh 1 Ie 
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Immat Beha 2 ines! 
Undependbl iL ales 18) 
Com Threat 1 1820 


Variable Z, Neighborhood # of Sources 


# of Sources Frequency Codes 


Variable AA1, SPH Listed Neighborhood Source Recommendations 


Type Recomm. Frequency Codes 
I Recomm oo it 
2 Recomm 81 le 
3-5 Recomm 76 13 
6-10Recomm S he 
1 DecCom il ol 
1 NotRec Z 41 
No Intervw S16 99 


Variable AA2, Developed Neighborhood Recommendations #1 


Type Recomm. Frequency Codes 
il Recomm sie 11 
2 Recomn 50 1 
3-5 Recomm 46 13 
6-10Recomm 2 14 
1 RecWsu 1 Zl 
1 DecCom Ly eal 
2 DecCom 8 32 
3-5 DecCom C 30 
ik NotRec 5 G1 
None Devel 400 99 


Variable AA3, Developed Neighborhood Recommendations #2 


Type Recomm. Frequency Codes 
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I DecCom 16, on 
2 DecCom Ze a2 
3-5 DecCom 2 33 
1 NotRec 1 41 


Variable AB1, Neighborhood Findings #1 


Information Frequency Codes 
Theft 1 1043 
Wild (Beh) 1 IOS aL 
Alcohol Pr 1 IL ANOG, 
DUI 1 1 IO} 
Mara jeel=5 5 aa 
Sus Theft 1 1140 
DrgUse 1-5 1 1141 
Unpd Bills 3 1233 
Disrespect 1 30 
Perf Unsat i isc 
Tresspass Z 1539 
Lack Motiv | 1563 
Braggart 1 S71 
Immat Beha 4 lay 3 
Undependb] 2 i578 
Suicide At i 1704 
Juven Rec 1 1843 
Mal Mischf il 1844 
Voyeur i 1847 
No Intervw 534 9999 


Variable AB2, Neighborhood Findings #2 


Information Frequency Codes 
Bre e& Ent 1 1000 
Theft i 1043 
Wild (Beh) 1 Ore a 
Alcohol Pr il 1100 
Susp Drugs 5 1140 
DrgUse 1-5 2 1141 
Unpd Bills Z 1232 
Ateteud ser Z i530 
Undependb]1 if MS Tes, 
Depression 1 lig age 


Variable AB3, Neighborhood Findings #3 


Information Frequency Codes 


Burglary 1 1001 
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Marij 1-5 1 Lie 
Juven Rec 1 1843 


Variable AC, Other # of Sources 


# of Sources Frequency Codes 
0 205 0 

98 1 
2 107 ja 
3 86 3 
d, 4.2 4, 
5 2 5 
6 6 
7. 7 
8 8 
7 1 


Variable AD1, Other Recommendations #1 


Type Recomm. Frequency Codes 
1 Recomm 95 aa! 
2 Recomm FOS 12 
3-5 Recomm 126 13 
6-10Recomm 10 14 
1 RecWsSu 3 Zu 
1 DecCom 4 oy Al 
2 DecCom 1 a7 
1 NotRec 5 41 
2 NotRec 3 en 
3-5 NotRec 7 43 
No Intervw 205 99 


Variable AD2, Other Recommendations #2 


Type Recomm. Frequency Codes 
i Recomm Zz Val 
2 Recomm Z 12 
1 DecCom 5 Sal 
3-5 DecCom 1 30 
1 NotRec 10 fal 
2 NotRec 2 42 
11+ NotRec 1 45 


Variable AD3, Other Recommendations #3 


I 


Type Recomm Frequency Codes 


Variable AE1, Other Findings #1 


Information Frequency Codes 
Brk & Ent 1 1000 
Keotolerop i 1041 
Theft 1 1043 
Sus Theft 2 1043 
Lied 2 ILO} 7001 
Alcohol Pr 2 1100 
Min Alc Po 1 TECS 
Marij Arr 1 1120 
Marij 1-5 12 121 
Susp Drugs = 1140 
DrgUse 1-5 1 1141 
Bad Checks 2 1230 
Unpd Bills 2 1233 
Disq Nuc Z 160! 
Lack Motiv 1 1568 
Absnt Freq ] 1570 
Braggart it WS) 7 
Immat Beha 6 1573 
Needs Supv i 1d 5 
Undependb1 4, 1578 
Emot Prob 1 1701 
Emot Stab? 1 1702 
Psyche Top Ih 1703 
Anxiety i! 1710 
Juven Delgq 1 1842 
Juven Rec 4 1843 
Time Unact 1 127 
No Intervw 5107 9995 


Variable AE2, Other Findings #2 


Information Frequency Codes 
R StolProp it 1041 
Theft 4, 1043 
Lied 4, 171 
Wild (Beh) IL Te91 
Alcohol Pr 6 1100 
DUI 1 1 IAG 
Min? Ale Po 1 1 ANOS 
Maij 1-5 1 a 
Susp Drugs IL 1140 
DrgUse 1-5 3 1141 
Sale Drugs 1 IESG 


No Resti uf 13 1 
Homo Susp 1 1422 
Disrespect 1 1530 
Suspnd Sch I 1564 
Absnt Freq 2 1570 
Braggart 1 17 1 
Immat Beha I 15/3 
Easi Upset 3 1700 
Emot Stab? 2 170 2 
Suicide At iL 1704 
Depression if Wi 3 
Runaway 1 1845 


Variable AE3, Other Findings #3 


Information Frequency Codes 
Theft 3 1043 
Dishonest 1 1060 
Violent 1 1090 
Alcohol Pr i 1100 
Mari} 1-5 2 Tei | 
Susp Drugs 2 1140 
Fired 1 16 1 
Immat Beha 3 1573 
Reckl Cond J 1576 
Undependb] 3 1578 
Emot Stab? 3 1702 
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Variable AE4, Other Findings #4 


Information Frequency Codes 
Forgery Z 1010 
Theft 1 1043 
Assault 1 1050 
Battery 1 1051 
feonol Pr 1 1100 
Unpd Bills 1 LZao 
Lack Motiv 1 1563 
Immat Beha 1 LSS 
Emot Prob i 170} 
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Race Frequency Codes 
Caucasian 397, 1 
Black 50 Z 
Hispanic Tal 3 


qs 


Oriental 2 4 
Unknown mers 5 


Variable AG, Marital Status 


Status Frequency Codes 
Unmarried Sy 1 
Married Si: 2 
Divorced 14 4 


Variable AJ, # of Dependents 


# of Dependents Frequency Codes 


aye jh 
30 


Variable AN, # of Siblings 


# of Siblings Frequency Codes 
0 Siblings 24 0 
1 Siblings 73 iL 
2 Siblings 118 Us 
Seo lp lings 125 3 
4 Siblings 82 4 
> oi lines 56 5 
6 Siblings 14 6 
7 Siblings I i 
8 Siblings 9 8 
9 Siblings 4 a 
TOM Salo lie: 4 10 
12 Sibling 1 2 
Unkn # Sib ne, 50 


Variable AO, Permanent Residence 


Alabama 
Arizona 
Arkansas 
Gal vuborniva 
Colorado 
Connecticu 


Frequency Codes 
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DC a Vd 


Florida 24 12 
Georgia Dil 13 
Hawaii 2 i> 
Idaho 3 16 
Tllinois 20 17 
Indiana ales. 18 
Iowa 7 19 
Kansas 2 20 
Kentucky 7 Za 
Louisiana 6 22 
Maine 3 uF 
Maryland 10 24 
Massachuse 20 2 
Michigan 24 26 
Minnesota 4 27. 
Mississipp ji ae 
Missouri 14 29 
Nebraska 3 Sal 
Nevada 5 ae 
New Hampsh iu Se 
New Jersey Ihe, 34 
New Mexico 5 35 
New York 41 36 
North Caro ilies 37 
Ohio 34 39 
Oklahoma 9 40 
Oregon 6 41 
Pennsylvan 45 42 
Puerto Ric 1 43 
Rhode Isla 5 44 
South Caro lS 45 
South Dako 5 46 
Tennessee 9 ay 
Texas 36 48 
Virginia 19 ol 
Washington 6 ys 
West Virgi 8 54 
Wisconsin 9 55 
Wyoming 3 56 


Variable AR, Age at enlistment 


Age Frequency Codes 
0 i 0 
le: 7 1 lo. 7 
ub?! 7 7.0 
ie | 2 vies A 
i. 2 3 Vig @ 
i. 3 y i. 3 
17.4 8 17.4 
im. 6 7 7. 6 
ie. 7 9 7, 


US 


WO COrsxIa UN & WP ee 


WO ONAIDMN Whe 


OMNI ANAMNON FH 


OMIA WHF 


WH Fe 


\O © 


OMmOnAMNE WHF 


I~ DOD DP NF WNHOEH FMB WWD WW WANK KF ODWIOW IAW OND J ~ 


HPWH DUO MNYIHNF WHR DOU WYANWNF OW WYNIAHANFWNHKFOWWNYNANLFWNHF OW WYIAMNFWNHEH OO OO 


76 


294. 1 2255 
25,0 1 250 
Ths oe 2 Zoe 
24.1 2 24.1 
24.2 Z 24.2 
24.4 1 24.4 
24.8 I. 24.8 
25 1 2on0 
Ze. 3 2 Loe 
Za. 2 1 a. 5 
Ze. © 3 Ze,6 
Zz. 8 1 Zone 
ey 1, 27-0 
uy. 1 1 oy. | 
ee 9 1 a. 9 
28 1 Zo. 0 
Zo. 1 1 Zo. 1 
Zo. 3 1 Zo. 3 
28.4 i 28.4 
Zo. 3 il 2 ae 
29.5 ih Zon S 
29.8 1 2220 
31.4 1 31.4 


Variable AS, Months Between HS and Enlistment 


Months Frequency Codes 
0 48 0 
iL 52 1 
2 40 2 
3 30 S 
4 Ze 4 
5 10 5 
6 20 6 
7 22 i 
8 Zs 8 
9 14 9 

10 16 10 
al 10 ipl: 
2 7 eZ 
iS i 13 
14 hal 14 
as. 16 15 
16 6 16 
iy 9 17 
18 9 18 
13 5 12 
20 8 20 
JA 9 7A 
Z2 6 22 
18, 3 23 
24 9 24 
Z5 8 25 


Wa 


26 - 26 
27 3 27 
28 Z 28 
29 é 2g 
30 Z oo 
Sok 5 oa 
a2 4 32 
Gs Z 32 
34 8 34 
35 6 35 
36 6 SS 
a7 6 se)! 
38 tl 38 
$y) 4 oy) 
4,1 3 4,1 
2 5) 42 
43 4 43 
+4 6 44 
45 3 45 
46 2 46 
47 Z 47 
48 4 48 
49 Z 49 
50 + 50 
Sy + a2 
54 2 54 
56 2 56 
a th 57 
58 3 SNS: 
60 4 60 
61 1 61 
62 ] 62 
64 1 64 
65 1 65 
66 Ih 66 
68 2 68 
ae IL 72 
74 2 74 
79 2 29 
80 1 80 
82 i 82 
87 i 87 
G2 1 Oe 
oF Z OF 
98 1 98 
102 g 103 
105 1 105 
108 1 108 
17 A ] 21 
LS. 1 123 
124 i 124 
i 3 ns 
134 ih 134 
158 1 sys: 


78 


Variable AT, Jobs Between HS and Enlistment 


maot Jobs Frequency Codes 
0 149 0 
it 157. if 
D 90 Z 
3 64 3 
4 38 4 
5 30 5 
6 2 6 
i 9 7 
8 6 8 
9 2 9 
al il Wal 
IZ 1 12 
15 Z ES: 
16 zZ 16 
7 1 17 


Variable AU, Months Unemployed HS to Enlistment 


# of Months Frequency Codes 
0 Toye 0 
il yal 1 
2 58 2 
3 29 S. 
4 23 4 
5 26 5 
6 16 6 
i] 18 7 
8 19 8 
2 16 9 

10 13 10 
ital 4 el 
12 6 TZ 
3 7 is 
14 eZ 14 
1S) 4 15 
16 8 16 
17 6 17 
18 4 18 
1s, 4 mM 
20 2 20 
oA 7 Za 
a2 Z 22 
23 2 23 
24 2 24 
ZS 6 25 
26 iL 26 
27 2 Le 
28 3 28 
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29 1 23 
onl 1 oul 
De 1 a2 
59 2 oye 
35 Zz 5 
36 i 36 
37 1 a7 
38 5 38 
42 1 42 
+4 1 44 
45 3 45 
46 2 46 
=) It 1 Dil 
D2 1 ye 
58 2 58 
ON) 1 5), 
71 1 VI 
G2 1 92 
106 1 106 


Variable AV, # Months College Attended 


## of Months Frequency Codes 
0 465 0 
1 3 i 
2 7 2 
6 6 Ss) 
4 6 4 
5 4 5 
6 5) 6 
7 5 7 
8 2 8 
9 14 g 

10 2 10 
ial il IL 
I 2 7 2 
13 Z 13 
14 2 14 
Ls 2 15 
16 2 16 
18 14 18 
20 i 20 
ou A. Jo) 
Jaf 1 22 
24 1 24 
26 I] 26 
IAG} if 27 
28 1 es 
29 i 20 
36 2 36 
42 Z 42 
44 | 44, 
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Var AW, # Months Unemployed Immediately Prior to Enlistment 


## of Months Frequency Codes 
0 yay | 0 
1 118 1 
2 54 yi 
s) 34 3 
4 26 4 
5 20 5 
6 18 6 
7 10 7 
8 9 8 
9 8 9 

10 i 10 
11 2 11 
1 4 ZZ 
1186 3 6: 
14 3 14 
15 2 IS 
16 2 16 
20 1 20 
Ail 1 ya 
24 1 24 
26 1 26 
28 1 28 
22 1 2 

BS i Se 


Variable AX1, Unfavorable Information on SPH #1 


Information Frequency Codes 
Brk & Ent S 1000 
Burglary 3 1001 
Larceny i 1040 
Robbery 1 1042 
Theft Gs 1043 
Assault 6 1050 
DOT 1 6 Wey] 
DUI >1 1 T1G¢2 
Drnk & Dis 1 LG 
Liq Law Vi 5 1104 
Min Alc Po 7 1105 
Pupiaitox 1 2 1106 
Marij Arr 1, LiZe 
Marij 1-5 12 vel 2 
Pera) >25 if 1s 
DrgUse 1-5 1 1141 
GED 1 1510 
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Emp lt” Prob 1 Sy SYS: 
Tresspass 4 i559 
Fired 34 Sites.) 
Freq Quit 1 > 62 
Suspnd Sch 1 1564 
Undependbl 1) 1578 
SusCommSym 1 1600 
Leave Acct 2 1804 
Reck Drive 1 P3805 
Mov Veh 1 50 1810 
Mov Veh2-5 54 es JI 
Mov Veh >5 1 1802 
Stnd Veh 1 2 1813 
StndVeh2-5 2 1814 
Disord Con Z 1821 
Dstb Peace i 1822 
Harrassmnt 1 1823 
Lisc Suspd il nS25 
Loitering Z 1826 
Misdemeanr 3 1627 
Vandalism 1 1829 
Juven Delq ih 1842 
Juven Rec 29 1843 
Runaway ] 1845 
Empl False Z 1924 
Time Unact IL 127 
Blank 293 9999 
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Variable AX2, Unfavorable Information on SPH #2 


Information Frequency Codes 
Bakar © Gite 3 1000 
Burglary 1 1001 
Theft ib 1043 
Sus Theft ih 1044 
Assault 2 1050 
Battery 5 IOI) 1 
imeon 4 gpl 1 1070 
Alcohol Pr 2 1100 
DUI 1 5 1101 
Liq Law Vi 3 1104 
Min Ale Po 4 1 Os: 
Pup ilimtex. 1 2 1106 
Marie arr i) 1120 
Mardy) 1=5 Z 12 1 
HalicdO= 25 1 22 
Emplt Prob il 1533 
Per Unsat iL 36 
Tresspass 6 1, Se. 
Fired 3 6 | 
Freq Quit 2 1562 
Absnt Freq 4 1570 
Reck Drive i 1805 


Mov Veh 1 19 1810 
Mov Veh2-5 12 1811 
Mov Veh >5 1 1812 
Disord Con 3 1621 
Dstb Peace 3 1822 
Il Firearm 1 1824 
Lisc Suspd 6 1825 
Misdemeanr 1 1827 
Grim Misch i 1840 
Juven Rec 4 1843 
Mal Mischf 2 1844 
Empl False 3 1924 


Variable AX3, Unfavorable Information on SPH #3 


Information Frequency Codes 
Larceny Z 1040 
Assault 1 1050 
Alcohol Pr 1 1100 
DUI 1 1 1101 
Marij Arr 2 1120 
Marij 1-5 5. TZ 
DrgUse 1-5 1 1141 
Non HS Grd 1 1511 
Disag Empl iL 1532 
Tresspass 1 MSS iy 
Absnt Freg 1 1570 
Hit & Run 1 1803 
Reck Drive Z 1805 
Mov Veh 1 7 1810 
Mov Veh2-5 Z 1811 
Stnd Veh 1 1 1813 
Disord Con 1 Gia 
Lisc Suspd 1 1825 
Dmg Per Pr 1 1841 


Variable AX4, Unfavorable Information on SPH #4 


Information Frequency Codes 
DUT 1 1 eaeRl 
Min Alc Po 1 1105 
Foolntox 1 1 1106 
Bad Checks it 1230 
ContDelMin 1 1410 
Leave Acct 1, 1804 
Mov Veh 1 1 1810 
Mov Veh2-5 1 1811 
Misdemeanr 1 1827 
Juven Rec 1 1843 
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Male iisent iL 1844 


Variable AY1, Summary of BI #1 


Information Frequency Codes 
Bar & Ent i 1000 
Burglary 4 1001 
Larceny 2 1040 
Robbery 1 1042 
Tmett 9 1043 
Sus Theft 5 1044 
Assault ) 1050 
Rape i) TOS2 
Lied 3 1071 
Violent 1 1090 
Meroe V5 3 1100 
DUI 1 6 1101 
DUI >1 1 TZ 
Liq Law Vi il 1104 
Min Alc Po 2 1195 
Marij Arr iz 1120 
Maas djaele 18 TE 
Marij 6-25 Zz Tae 
Manet” 225 Z TAS 
Susp Drug 3 1140 
DrgUse 1-5 =e 1141 
Credit 2 1210 
Bad Checks 2) ie 
Unpd Bills 14 T2755 
Homo Susp i, 1422 
GED 6 1510 
Non HS Grd 18 bs tha 
Abtaed, Pr 1h 1530 
Emplt Prob 4 1333 
Fail Obey 1 oe 
Insubordin 1 12> 
Not El Reh 1 1537 
Perf Unsat 2 1538 
Tresspass S: 139 
Tng Dischg 1 1543 
Fired 26 oie! 
Freq Quit 4 1562 
Lack Motiv i 1563 
Suspnd Sch 4 1564 
Absnt Freq 3 Lo70 
Immat Beha Lal 1573 
Lack Judge 1, 1574 
Needs Supv I ILS) 5) 
Undependbl 3 1578 
Emot Prob 2 Po 
Emot Stab? 3 2 
Leave Acct 1 1804 
Reck Drive IL 1805 
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Mov Veh 1 2 1810 
Mov Veh2-5 ie Fenet 
Mov Veh >5 2 Tea 
Runaway 2 ro20 
Lisc Suspd 1 1825 
Misdemeanr 1 1827 
Out Warrnt 1 1828 
Dmg Per Pr 1 1841 
Juven Delq iL 1842 
Juven Rec 20 1843 
Uncont® Juv 1 1846 
AddrNotSPH i 1920 
Job NotSPH 3 1923 
Empl False 21 1924 
Time Unact 3 1927 
No Der Inf 282 9999 


Variable AY2, Summary of BI #2 


Information Frequency Codes 
pee eebnt 3 1000 
Burglary 1 1001 
Forgery i 1070 
Theft 10 1043 
Sus Theft 1 1044 
Assault 5 1050 
Battery 1 10a 
Incorrigbl 1 OAC 
Lied 4 1071 
Alcohol Pr 10 1100 
wot 1 5 iLO 
DUI >1 i 1102 
Liq Law Vi 1 1104 
Mim Alc Po 1 IGS 
Eubintox 1 1 1106 
Petea)) Arr 2 120 
iema 4 1-5 12 bball 
Susp Drugs Z 1140 
DrgUse 1-5 5 1141 
Sale Drugs 4, i510) 
Drug Waivr 1 1160 
Bad Checks 1 75510 
No Restit i 1231 
Sued Nonpa 1 Tae 2 
Unpd Bills 9 W253 
Unpd Taxes 1 1234 
Bankruptcy 1 1250 
Homo Admit 1 1421 
Homo Susp 1 1422 
Vio Parole Z E02 
GED 1 1510 
Non HS Grd ik 11 
erta Pr 3 Le Sie 


EMp it. eoD 1 lses 
Not El Reh Z LS SF 
Perf Unsat 5 [536 
Tresspass 2 1539 
Fired 18 Sioa 
Freq Quit i 1562 
Suspd Sch 1 1564 
Absnt Freq 10 1570 
Braggart 1 1s) 7 A 
Fail Aper 4, 2 
Immat Beha 1 ISS: 
Needs Supv 1 SS 
Undependb1 g 1S) Hie 
Emmet er nop i 1701 
Emot Stab? 2 1702 
Anxiety IL Tako 
Depression Je les 
Reck Drive 1 1805 
Mov Veh 1 10 1810 
Mov Veh2-5 4 lie IU I 
Mov Veh >5 I aes 2 
StndVeh2-5 I. 1814 
Lisc Suspd 1 1825 
Crim Misch 1 1840 
Dmg Per Pr 6 1841 
Juven Rec 5 1843 
Voyeur 1 1847 
Add NotSPH 1 1920 
Clmd Diplm 2 192 
Fraud Enli 1 1922 
Job NotSPH 1 IZ 
Empl False 3 1924 
Variable AY3, Summary of BI #3 
Information Frequency Codes 
Forgery 1 1010 
Larceny 1 1040 
Reocol Pro il 1041 
Theft 4 1043 
Sus Theft i 1044 
Assault i 1050 
Lied 3 107A 
Wild (Beh) 1 TOs 
Alcohol Pr 6 1100 
DUI 1 ik TAO 
Liq Law Vi if 1104 
Min Ale wero 1 TOS 
Piubintes 1 2 1106 
Marij 1-5 4 al 
Susp Drug Z 1140 
DrgUse 1-5 Z 1141 
Bad Checks 3 i230) 
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No Restit 1 oil 
Unpd Bills 3 1233 
ContDelMin 1 1410 
Un Absense 1 1501 
Non HS Grd i) 15nht 
Not El Reh 3 1537 
Perf Unsat 2 1536 
Fired 5 1561 
Freq Quit fF 1562 
Absnt Freq 5 1570 
Braggart i 13 
Immat Beha Z. sys) 
Tardy il 1577 
Undependb1 Z 1578 
SusCommSym 1 1600 
Emot Prob i! 1701 
Reck Drive 1 1805 
Mov Veh 1 6 1810 
Mov Veh2-5 4 1811 
Disord Con 2 1821 
Harrassmnt 1 1623 
Il Firearm 1 1824 
Misdemeanr 1 1627 
Out Warrnt 1 1828 
Juven Rec 2 1843 
Mal Mischf 1 1844 
Job NotSPH 1 1923 
Empl False 3 1924 
Time Unact Z 1927 
Variable AY4, Summary of BI #4 
Information Frequency Codes 
Theft 3 1043 
Sus Theft i 1044 
Paeconol Pr 1 1100 
P@pintox 1 i 1106 
Marij 1-5 2 miial 
Memes) 6-25 1 Zz 
DrgUse 1-5 3 1141 
Not El Reh iL L537 
Fired 2 {561 
Suspnd Sch 1 1564 
Absnt Freq 2 1570 
Immat Beha 8 157 3 
Reckl Cond 1 1576 
Tardy i 1577 
Undependb1l 3 1578 
Emot Stab? i 1702 
Nerv Cond i 1 
Leave Acct 1 1804 
Reck Drive 1 1805 
Mov Veh 1 Z 1810 
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Mov Veh2-5 1 ilfevalea) 
Dstb Peace 1] 822 
Empl False 1 1924 


No Clrance 3 0 
Confident. 1 1 
Secret 105 2 
Top Secret 141 3 
SCL 1 4 
Unknown 33 9 


Variable BF, Release Code 


Release Frequency Codes 
Pat Miscon 3 1 
DrgUse Tra 1 2 
DrgUse Pos 12 3 
Ser. Offen 2 4 
Pers Dis B 7 5 
Homos Acts 1 6 
Inv Dischg 44 7 
HonDisHomo 17 8 
Drug Abuse 74 9 
Min Dis in 2 10 
MisconSerO 14 11 
Homos Acts 30 12 
Homo State ie ie 
HonDisLesb 1 14 
Pers Dis 20 is 
Con Ct Mar 73 16 
HonDisUnsu 39 ly 
HonDisUnHo Z 18 
Vol Dischg 10 19 
Invol Rele 162 20 
HonDisOffi 33 aM 


Variable BGl, Military Offenses #1 


Information Frequency Codes 
Larceny 8 1040 
Theft 3 1043 
Assault 8 1050 
Lied 1 1071 
Alcohol Pr i 1100 
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Drnk & Dis 2 L035 
Mari) 1-5 44 gle? 
DrgUse 1-5 eZ 1141 
Sale Drugs 1 1150 
Bad Checks 1 1230 
Homo Admit 3 Vea 
Desertion 6 Isl Ole, 
Un Absense 120 HS OU 
Disrespect 6 1530 
Derel Duty 2 1531 
Fail Obey 25 1534 
Disenr Sch it 1540 
Com Threat 1 1820 
Piso mEdeCon 1 1821 
Harrassmnt i 1623 
Meaud Enli 1 1922 
No Der Inf 301 9999 


Variable BG2, Military Offenses #2 


Information Frequency Codes 
Larceny i 1040 
Theft 3 1043 
Assault 8 1050 
mieconol Pr > 1100 
Euointox 1 ik 1106 
iad] 1-5 33 Za 
Susp Drugs i 1140 
DrgUse 1-5 19 1141 
Sale Drugs 2 10 
Att Sodomy 2 1420 
Homo Admit 3 1421 
Desertion 34 1500 
Un Absense OZ 101 
Disrespect 4 130 
Derel Duty 2 les 
Fail Obey 24 1534 
Emot Stab? 1 2: 
Con Weapon i 1800 
Dstb Peace 1h ez 
Dmg Per Pr 1 1841 


Variable BG3, Military Offenses #3 


Information Frequency Codes 
Larceny 3 1040 
Theft Ms 1043 
Assault 6 1050 
frconol Pr 5 1100 
Drnk & Dis 1 1103 
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Marij 1-5 13 eA 


Susp Drugs il 1140 
DiroUseak-5 5 1141 
Homo Admit | tan 
Indec Expo 1 1431 
Desertion 10 1500 
Un Absense 10 1501 
AGeicd, Pr 5 15380 
Derel Duty 3 As) s' 
Fail Obey 11 1534 
Suicide At 1 1704 
Com Threat 1 1820 


Variable BG4, Military Offenses #4 


Information Frequency Codes 
Larceny 1 1040 
Theft 1 1043 
Assault Zz 1050 
DrgUse 1-5 c) 1141 
Bad Checks i 1230 
Att Sodomy 1 1420 
Attitd Pr 1 1530 
Fail Obey 1 1534 


Variable BH1, Remarks/Discharge 


Remarks Frequency Codes 
KeoLelELoDp 1 1041 
Assault i! 1050 
Rape 1 152 
AleonoL Pr Z 1100 
Do Z 1101 
V0 a i ee a) i) bi) 
DrgUse 1-5 5 ia sah 
Drug Waivr 46 1160 
Homo Admit 2 NA Ih 
Fail Obey iL 1534 
Disenr Sch 39 1540 
Hard Disch 1 ay 
Human Reas i 1542 
Decert PRP i i510 
Disqul Nuc yi 1s Syl 
Emot Stab? il 1702 
Suicide At 2 1704 
Disord Con 1 e220) 
Fraud Enl eal 19272 
No Svc Rec 438 9999 
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Variable BH2, Remarks/Discharge 


Remarks Frequency Codes 
Burglary 1 1001 
Larceny 1 1040 
Lied 1 1071 
meeonol Pr 2 1100 
ieatay 1-5 il 11 
DrgUse 1-5 8 1141 
Sale Drugs 1 1150 
Homo Admit 3 Le 2) 
Un Absense 1 1501 
Disenr Sch y 1540 
Decert PRP 6 1550 
Disqul Nuc 5 sisal 
Lack Motiv 3 1563 
Suicide At 1 1704 
Disord Con 1 Te2 4 
Fraud Enli 2 nOZ2 


Variable BH3, Remarks/Discharge 


Remarks Frequency Codes 
Sale Drugs 1 1050 
pioeitox 1 | 1106 
DrgUse 1-5 2 1141 
Un Absense 1 5 Ol 
Disenr Sch if 1540 
Decert PRP 2 1550 
ie) Nuc i ILS Syl 
Harrassmnt 1 ie 3 


Variable BH4, Remarks/Discharge 


Remarks Frequency Codes 


Variable BL, Status of 5520/20 


Status Frequency Codes 
No Rec BI 82 90 
Rec Not Fd 70 ches 
Blank 412 99 
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Variable BM, Discharge Case Codes 


Case Codes Frequency Codes 
BI-Clear 324 101 
SBI-Clear als) 102 
IBI-Clear 35 103 
Bi sud tap. 74 bs 
SB -suatabp 2 Ize 


Variable BO, Interservice Seperation Code 


Seperation Codes Frequency Codes 


Unknown 10 0 
Bus 195 1 
Char/Beh D 65 60 
Discred In 50 65 
Drugs 86 67 
Court Mart TS 7 
Homosexual 70 76 
Serious Of ie 84 


Variable BP, Type of Discharge 


Type Frequency Codes 
Honorable 328 il 
General 10 Zz 
Oth Th Hon 226 5 


Variable BQ, Character of Service 


Character Frequency Codes 
Good 205 0 
Homosexual 70 } 
Misconduct 65 2 
Drug/Alcoh 86 3 
Coure, Mart 73 4 
Char/Beh D 65 5 
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APPENDIX B. FOUR-DIGIT DEROGATORY INFORMATION CODES 


The codes contained in this appendix are those used in the data base. 


CODE Behavior 

1000 Breaking and entering 

1001 Burglary 

1010 Forgery 

1040 Larceny 

1041 Received stolen property 

1042 Robbery 

1043 Theft, grand or petty 

1044 Theft suspected 

1050 Assault 

HOS 1 Battery 

TOS 2 Rape 

1060 Dishonest 

1070 Incorrigible 

Tay 1 Lied 

1090 Violent 

1091 Wild (behavior) 

1100 Alcohol problems 

1101 Driving under the influence, 1 time 

HNO 2 Driving under the influence, more than 1 time 
1103 Drunk and disorderly 

1104 Liquor law violation 

£105 Minor in possession of alcohol 

1106 Public Intoxication, 1 time 

MOT Public Intoxication, more than 1 time 

1120 Marijuana arrest 

eZ A. Marijuana use 1 to 5 times 

ez 2 Marijuana use 6 to 25 times 

1123 Marijuana use more than 25 times 

1140 Drug use suspected 

1141 Drug use (cocaine, heroin, etc.) 1 to 5 times 
1142 Drug use (cocaine, heroin, etc.) 6 to 25 times 
1143 Drug use (cocaine, heroin, etc.) more than 25 times 
1150 Sale of drugs 

1160 Drug Waiver 

2 oO Credit 

1230 Bad checks 

1231 Restitution not made 

2 o2 Sued for non-payment of account 

1233 Unpaid bills or accounts 

1234 Unpaid taxes 

205 Vehicle repossessed 

1250 Bankrupcy 

1400 Do anything for money 

1410 Contributing to the delinquency of a minor 
1420 Attempted Sodomy 

1421 Admitted homosexual 
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1422 


Suspected homosexual 

Juvenile homosexual behavior 
Dressed as a female 

Indecent Exposure 

Desertion 

Unauthorized absense 

Violation of parole 

General Education Degree (GED) 
Did not graduate from high school 
Attitude Problem 

Disrespect (same code as above) 
Dereliction of duty 
Disagreements with employer 
Employment problem 

Failure to obey a lawful order 
Insubordinate 

Misconduct 

Not eligible for rehire 
Performance unsatisfactory 
Tresspass 

Disenrolled from school 

Hardship discharge required 
Humanitarian reassignment 
Training discharge 

Decertified from Personnel Reliability Program (PRP) 
Disqualified from the nuclear field 
Expelled 

Fired 

Frequently quit or left job 

Lack of motivation 

Suspended from school 

Absent frequently 

Braggart 

Failure to appear 

Immature behavior 

Lacked judgement 

Needs supervision 

Reckless conduct 

Tardy (school or job) 
Undependable 

Suspected communist sympathies 
Easily upset 

Emotional problems 

Mental or emotional stability questioned 
Psychological problem 

Suicide attempt 

Anxiety 

Introverted 

Nervous condition or unable to cope with stress 
Depression 

Concealed weapon 

Felony 

Fugitive 

Hit and run 

Leaving the scene of an accident 
Reckless driving 
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Vehicle violation, moving, l 


Vehicle violation, moving, 2 to 5 
Vehicle violation, moving, 6 or more 


Vehicle violation, standing, 
Vehicle violation, standing, 
Vehicle violation, standing, 
Communicate a threat 
Disorderly conduct 
Disturbing the peace 
Harrassment 

Illegal use of firearms 
Liscense suspended 
Loitering 

Misdemeanor 

Outstanding arrest warrant 
Vandalism 

Criminal mischief 

Damage to personal property 
Juvenile delinquent 
Juvenile record 

Malicious mischief 

Runaway 

Uncontrolled juvenile 
Voyeur or Peeping Tom 

Used alias 


Zz tors 
6 or more 


Address not listed on Statement of Personal History (SPH) 
Claimed to be a diploma graduate of high school 


Fraudulent reenlistment 
Jobs not listed on SPH 


Length of employment falsified 


Prior service concealed 


Social Security Number (SSN) falsified or altered 


Time unaccounted for 
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APPENDIX C. BOOTSTRAP RESIDUAL PROGRAM 


This appendix contains the program which produces an empirical estimate of the 
upper-quantile distribution of the maximum residuals produced by independent, cross- 


tabulated variables. 


PROGRAM BSRESD 
Yeredededeveseveskeseskesleseakesicskatscakak dks kak kakakakoesiedescakseakat desks akakeakeake sk ke akeseakseakakslesk akeaksesteslese se koksleseskskevesestesests 


* THIS PROGRAM CONDUCTS A MONTE-CARLO SIMULATION USING THE MULTI- * 
_ NOMIAL RANDOM NUMBER GENERATOR FROM THE IMSL LIBRARY. THE * 
* PURPOSE OF THE SIMULATION IS TO GENERATE A NUMBER OF REPLICATIONS* 
% OF POSSIBLE COUNTS OCCURING IN A CONTINGENCY TABLE WHEN THE ¥ 
* NULL HYPOTHESIS OF INDEPENDENCE, OR EQUAL PROBABILITY APPLIES. * 
* SPECIFIC ROW AND COLUMN COUNTS FROM CONTINGENCY TABLES, AND A ¥ 
* HEADING FOR IDENTIFICATION ARE READ FROM AN INPUT FILE. THE * 
¥ MULTINOMIAL RANDOM NUMBER GENERATOR PRODUCES SIMULATED COUNTS * 
* BASED UPON THE CELL PROBABILITIES PRODUCED FROM THE MARGINAL * 
¥ COUNTS. EACH REPLICATION OF THE SIMULATION IS TREATED AS A * 
* NEW CONTINGENCY TABLES AND THE STANDARDIZED RESIDUALS ARE COM- * 
¥ PUTED AS IN AN ACTUAL CONTINGENCY TABLE. THE LARGEST ABSOLUTE ¥ 
* RESIDUAL FOR EACH REPLICATION IS SAVED AND STATISTICAL * 
ve INFORMATION ABOUT THE LARGEST ABSOLUTE RESIDUAL PRODUCED BY * 
* INDEPENDENT VARIABLES IS COMPUTED. * 


Yervovedededesevedkseveskevetesksedeak ede k kevedevededkesdeved skewed sea dea kak vedseske dk akseseoeselcscceseseskseaksesdksevece skate tek vesk de 


CHARACTER*79 TBLNAM 

REAL MXRESD( 200) ,ORDMRE( 200) ,RESID( 200,70) ,MNRSD,P( 70) ,E( 70) 
INTEGER ROWNUM,COLNUM , ROWCNT( 20) , COLCNT( 20) , TOTAL, RSLT( 200,70), 
X CELLS 


*ESTABLISHES THE INPUT/OUTPUT FILES 
CALL EXCMS( 'FILEDEF 01 DISK BSRESID DATA A’) 
CALL EXCMS('FILEDEF 02 DISK BSRESID OUTPUT A’) 


NR = 200 
M=1 


*SETS THE SEED FOR THE RANDOM NUMBER GENERATOR 
ISEED = 54821 
CALL RNSET( ISEED) 


152 FORMAT(1X,'STATISTICS OF THE MAX RESIDUALS DERIVED FROM A ‘, 
X  'BOOTSTRAP'/1X,'SIMULATION OF CROSSTABULATED INDEPENDENT ', 
X ‘CATEGORICAL VARIABLES. '//) 
WRITE(2, 152) 


99 READ( 1, 100,END=999)TBLNAM 

100  FORMAT(A) 
READ( 1, 110)ROWNUM, COLNUM 

110 FORMAT(215) 
READ(1,120)(ROWCNT( 1), I=1,ROWNUM) 
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READ( 1,120) (COLCNT( I) , l=1,COLNUM) 
120 FORMAT( 2014) 


HOTAL = 0 
DO 10 I=1,COLNUM 
TOTAL = TOTAL + COLCNT(I) 
10 CONTINUE 


SEolABLISHES THE EXPECTED CELL SIZE AND THE PROBABILITY OF A COUNT 
*FALLING IN EACH CELL 


PTOT = 0. 
K=1 
DO 20 I=1,ROWNUM 

DO 20 J=1,COLNUM 


Bere —= (lO * ROWGNTC 1) * COLCNT(J)) / (1.0 * TOTAL) 
PGK) = ECR) ./. TOTAL 

riote— Prior + PUK) 

K=K+1 


20 CONTINUE 
CELLS = ROWNUM * COLNUM 
*ERROR TRAP IN CASE ROUNDING ERROR CAUSES PROBLEMS 
IF (ABS(1.0 - PTOT) .GT.002) THEN 
WRITE(2,*) ' LARGE DISCREPANCY ' 
GO TO 99 
END IF 
IF (PTOT .NE. 1.0) THEN 
P(CELLS) = P(CELLS) + (1.0 - PTOT) 
END IF 
*CALLS THE MULTINOMIAL RANDOM NUMBER GENERATOR 
CALL RNMTN(NR, TOTAL, CELLS ,P,RSLT,200) 
*COMPUTES THE RESIDUALS FOR EACH REPLICATION AND EACH CELL 
DO 30 I=1,NR 
MXRESD(I) = O. 
DO 30 J=1,CELLS 
Rev cl 0) —eRSLICI.) = EC J)ede/ -GoSORTC. ECJ)..)..) 
IF( ABS( RESID(I,J) ) .GT. MXRESD(I) ) 
X MXRESD(I) = ABS( RESID(I,J) ) 
30 CONTINUE 
*USES A BUBBLE SORT ROUTINE TO ARRANGE FROM SMALLEST TO LARGEST 
CALL BUBBLE(MXRESD ,NR,ORDMRE) 
*FORMATTED OUTPUT FOR OUTPUT FILE 


WRITE(2,101)TBLNAM 
101 FORMAT (1X,A) 


is 


WRITE(2,151)M,ROWNUM , COLNUM 


151 FORMAT(1X,'RECORD NUMBER = °,13/1X, A ,12, "BYs 9025 ecu, 
X 1X,'WITH THE FOLLOWING EXPECTED CELL SIZES: '/) 
kK = 20 


DO 35 I=1,ROWNUM 
WRITE(2,153)(E(J) , J=K+1, K+COLNUM) 
K = K + COLNUM 

35 CONTINUE 

153. FORMAT(15(1X,F5.1)) 


*WRITES OUT THE STATISTICS 


MNSRD = 0. 
DO 50 I=1,NR 
MNRSD = ORDMRE( I) + MNRSD 
50 CONTINUE 


MNRSD = MNRSD / NR 


DO 60 I=1,NR 
VARRSD = (ORDMREC( I) - MNRSD)**2 
60 CONTINUE 


VARRSD = VARRSD / NR 
SDRSD = SQRT (VARRSD) 


WRITE( 2,160) 


160  FORMAT(/T4&)'MEAN' ,TI8,' VARIANCE ,T249'STD DEV ,7T35).95 QUA @ = 
X  1T48,'.99 QUANT’ ,T61,'LARGEST VALUE’) 


WRITE(2,161)MNRSD,VARRSD, SDRSD, ORDMRE( INT(. 95*NR)) , ORDMRE( INT(. 99% 
X | NR)),ORDMRE(NR) 
161  FORMAT(T4,F6.4,713,F6. 4,T24,F6.4,T35,F6.4,T48,F6.4,T61,F6. 4//) 


COnTO ys 
ele)s, SLOP 
END 


SUBROUTINE BUBBLE( ARG ,N,RSLT) 
*USES THE BUBBLE SORT ALGORITHM TO ARRANGE A VECTOR OF VALUES, ARG 
REAL ARG(N),TEMP,RSLT(N) 


DO 10 I=1,N-1 
DOM ON Nei teen 

IF (ARG(J). LT. ARG(J-1)) THEN 
TEMP = ARG(J) 
NRC) SNS 
ARG(J-1) = TEMP 

END IF 

10 CONTINUE 


DO 20 I=1,N 
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APPENDIX D. INPUT FILE FOR THE BOOTSTRAP SIMULATION 
PROGRAM 


This appendix contains the input for each contingency table. Each table is repres- 
ented by four lines. Line 1 1s the title: line 2 is the number of rows and then the number 


of columns; line 3 1s the row marginal counts; line 4 1s the column marginal counts. 


RACE VS. CHARACTER OF SERVICE 
4 6 
397 50 13 104 
205. 7/0 G5mmeo. Jo 08> 
SUBJECTS WITH AN INCIDENT OF MAJOR CRIME VS. CHARACTER OF SERVICE 
2 6 
444 120 
205. 7/0365 S 7 oamo> 
SUBJECTS WITH ADJUSTMENT INCIDENTS DISCLOSED IN INV VS. CHAR OF SVC 
yi 6 
330) 228 
205 “FUR 65 GOmE 2 be 
MARITAL STATUS VS. CHARACTER OF SERVICE 
3 6 
517 S24 
205° 27 CRG Sco my oon 
SUBJECT SEX VS. CHARACTER OF SERVICE 
2 6 
479 = 85 
205. “FOR 65 SOC 2 amo® 
HIGH SCHOOL DIPLOMA BEARER VS. CHARACTER OF SERVICE 
2 6 
454 110 
205° LOM O50 oO” 751265 
CATEGORY OF MOST SERIOUS DRUG USE VS. CHARACTER OF SERVICE 
3 6 
418 105 41 
205. “FOS 65 BOO  72amoo 
DRUG/ALCOHOL ABUSE DISCLOSED IN INVESTIGATION VS. CHARACTER OF SERVICE 
2 6 
S)tol SPR Wins) 
205 7OMNOSmESG 735 765 
AMOUNT OF DRUG USE DISCLOSED IN INVESTIGATION VS. CHARACTER OF SERVICE 
4 6 
418 64 35 47 
205 70. 65 Gb 7 o ao 
DRUG USE DISCLOSED IN INVESTIGATION VS. CHARACTER OF SERVICE 
2 6 
418 146 
205 70. 65 56 We “65 
AGE AT ENLISTMENT VS. CHARACTER OF SERVICE 
4 6 
85 5820/6 as 
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20570 6586 73 65 
AGE AT ENLISTMENT VS. CHARACTER OF SERVICE 
3 6 
6565824097 
Olas 70 65° 86 73 65 
DEROGATORY INFO DISCLOSED IN INVESTIGATION VS. CHARACTER OF SERVICE 
2 2 
i>. 415 
205 359 
DEROGATORY INFORMATION DISCLOSED IN INVESTIGATION VS. CHAR. OF SERVICE 
Z 6 
151 413 
@U>, /0 65 86 73 65 
CHAR OF SVC VS. AMOUNT OF DEROG. INFORMATION DISCLOSED IN INVESTIGATION 
6 aL 
05870 65 86 73 65 
[emo semOG. 26° 25 28 32 26 25 29 79 
CHAR OF SVC VS. WEIGHTED RECOMMENDATION SCORE 
6 a 
Boe 70 65.86 73 65 
Mieel03 284 72 
CHAR OF SVC VS. RECOMMENDATION SCORE (ADJUSTED FOR NEGATIVE RECS. ) 
6 4 
Mee 70 65 86 73 65 
p25110 296 76 
CHAR OF SVC VS. NUMBER OF RECOMMENDATIONS DISCLOSED IN INVESTIGATION 
6 5 
gue 70 65 86 73 65 
62 84 314 67 36 
CHAR OF SVC VS. NUMBER OF RECOMMENDATIONS DISCLOSED IN INVESTIGATION 
6 4 


Bee 70 65 86 73 65 
62 84 314 104 
CHAR OF SVC VS. NUMBER OF NOT RECOMMENDED DISCLOSED IN INVESTIGATION 
6 Z 
Oe 70 65 86 73 65 
489 75 
ADMITTED DRUG USE (SPH) VS. DRUG USE DISCLOSED BY LOCAL AGENCY CHECK 
a) Z 
504 60 
510 54 
INFO OBTAINED FROM INTERVIEW (YES OR NO) VS. REASON FOR INTERVIEW 
2 - 
yo 195 
fois 118 7/6 
AMOUNT OF INFO OBTAINED FROM SUBJECT INTERVIEW VS. REASON FOR INTERVIEW 
5 a 
gee 72 66 33 24 
Woeel> 118 76 
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APPENDIX E. OUTPUT FROM THE BOOTSTRAP SIMULATION OF 
THE LARGEST RESIDUAL 


This appendix contains the output from the FORTRAN program which extracted 


the maximum residual from each replication of independent cross-tabulated variables. 


STATISTICS OF THE MAX RESIDUALS DERIVED FROM A BOOTSTRAP 
SIMULATION OF CROSSTABULATED INDEPENDENT CATEGORICAL VARIABLES. 


RACE VS. CHARACTER OF SERVICE 

RECORD NUMBER = 1 

A 4& BY 6 TABLE 

WITH THE FOLLOWING EXPECTED CELL SIZES: 


144.3 49.3 45.8 60.5 51.4 45.8 
1652 6,255.08 ta6 G5 58 
4.7 ee) Le Za) ee: ee, 
3/.. 8 li) 12, 0) ll ele 
MEAN VARIANCE STD DEV 95 QUANT .99 QUANT LARGEST VALUE 
2 oer C026 0.1120 Page 0 1. 3.6441 3. 7455 


SUBJECTS WITH AN INCIDENT OF MAJOR CRIME VS. CHARACTER OF SERVICE 
RECORD NUMBER = 1 

A 2 BY) 6 TeBEE 

WITH THE FOLLOWING EXPECTED CELL SIZES: 


161.4): 55 feo 2. "Oe 7 Se ole 
by 3 sO pemieGe . eeaLS os Bee Lec ellis as 


MEAN VARIANCE STD DEV ~95 QUANT .99 QUANT LARGEST VALUE 
ils eNOS 0. 0236 Onl 36 2. 8432 Sone 4.0793 


SUBJECTS WITH ADJUSTMENT INCIDENTS DISCLOSED IN INV VS. CHAR OF SVC 
RECORD NUMBER = i 

A 2 BY 6 TABLE 

WITH THE FOLLOWING EXPECTED CELL SIZES: 


122. 1296 See vs 2 oS oo 
62.9 «28:37 226530) 35 bec eo. 


MEAN VARIANCE STD DEV ~95 QUANT .99 QUANT LARGEST VALUE 
1. 8898 OO 1S 0.1070 2 On tal 5.1063 3. 4036 


MARITAL STATUS VS. CHARACTER OF SERVICE 
RECORD NUMBER = 1 
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A 3 BY 6 TABLE 
WITH THE FOLLOWING EXPECTED CELL SIZES: 


ovemoe 64. 2 959.6. ./8.8 66.9 59.6 

L250 4.1 3.65 Soil’ 4.3 beso’ 
De 1 ie? ie ies Ties) ne 
MEAN VARIANCE STD DEV -95 QUANT - 99 QUANT LARGEST VALUE 
20199 O-210 552 Cag 22 Ge RCo} ee) SES, 4.5982 


SUBJECT SEX VS. CHARACTER OF SERVICE 
RECORD NUMBER = i 

Pee 2 BY 6 TABLE 

WITH THE FOLLOWING EXPECTED CELL SIZES: 


eee oo. > 55.2 73.0 62.0 55.2 
eee? 10.5 geo oe 1S: Ole). 0 228 


MEAN VARIANCE STD DEV aoa QUANT .99 QUANT LARGEST VALUE 
1. 8686 Os Oem Or 1633 TESS 30 1 3 4.1773 


HIGH SCHOOL DIPLOMA BEARER VS. CHARACTER OF SERVICE 
RECORD NUMBER = 1 

A 2 BY 6 TABLE 

ween THE FOLLOWING EXPECTED CELL SIZES: 


Moe oO. 3 «2.3 «669.2 58.8 52.5 
Meme, 13.7 12.7 16.8 14.2 12.7 


MEAN VARIANCE STD DEV » 95 QUANT .99 QUANT LARGEST VALUE 
1. 8848 0.40257 O. 1604 ZeOGe/ 3. 1658 4.1537 


CATEGORY OF MOST SERIOUS DRUG USE VS. CHARACTER OF SERVICE 
RECORD NUMBER = I 

fue y «6426 TABLE 

WITH THE FOLLOWING EXPECTED CELL SIZES: 


ees (Ol. «8448.2 63.7 54.1 48.2 
Someeto. O° 12.1 16.07 13.6 12.1 
14.9 Del 4.7 cae, See, 4.7 


MEAN VARIANCE STD DEV ~95 QUANT .99 QUANT LARGEST VALUE 
2.1042 0.0170 D305 Ze US 3. 7080 3.9504 


DRUG/ALCOHOL ABUSE DISCLOSED IN INVESTIGATION VS. CHARACTER OF SERVICE 
RECORD NUMBER = 1 

A 2 BY 6 TABLE 

WITH THE FOLLOWING EXPECTED CELL SIZES: 


141.4 48.3 44.8 59.3 50.3 44.8 
yo fie, «20,2 26.7 22.7 20.2 


rue 


MEAN VARIANCE STD DEV - 95 QUANT, »99 QUANT LARGEST VALUE 
1. 8342 OO MLS Oe ele) tl 2.5641 2.9649 3. 3349 


AMOUNT OF DRUG USE DISCLOSED IN INVESTIGATION VS. CHARACTER OF SERVICE 
RECORD NUMBER = 1 

A 4 BY 6 TABLE 

WITH THE FOLLOWING EXPECTED CELL SIZES: 


151.9 5159 “48.2 63577 9 54. 1 ese 

255 oo 7.4 oc ol, 7.4 

2a 4.3 4.0 Se: 4.5 4.0 

yee dl Sires) 5.4 fine 2), Il 5.4 
MEAN VARIANCE STD DEV -95 QUANT .99 QUANT LARGEST VALUE 
2.2402 0.0370 Onl22> 3. FOZ ye 28 4.9623 


DRUG USE DISCLOSED IN INVESTIGATION VS. CHARACTER OF SERVICE 
RECORD NUMBER = 1 

A 2 BY 6 TABLE 

WITH THE FOLLOWING EXPECTED CELL SIZES: 


151.9 5159 48.2 637570 oe eee coe 
53,1) 16el “10.0 9223 G ee ne 


MEAN VARIANCE STD DEV -95 QUANT .99 QUANT LARGEST VALUE 
1. 8495 0.0471 Oa217 0 2. 5684 3.1447 4.9181 


AGE AT ENLISTMENT VS. CHARACTER OF SERVICE 
RECORD NUMBER = 1 

A 4 BY 6 TABLE 

WITH THE FOLLOWING EXPECTED CELL SIZES: 


30 o seu Oe 9.6 13405, Teo 925 

138.8 47.4 44.0 58.2 49.4 44.0 

28.4 ee o> 1 See 7.0 
OY 2.4 Ze Zag ae 2 
MEAN VARIANCE STD DEV -95 QUANT . 9920UANT LARGEST VALUE 
PAW sy: OeOL12 0. 1032 3.0226 B. 5557 3.6740 


AGE AT ENLISTMENT VS. CHARACTER OF SERVICE 
RECORD NUMBER = iL 

Alo BY 65 ba BLE 

WITH THE FOLLOWING EXPECTED CELL SIZES: 


30. 9 eG Soc aa les a Omi ies. 8, 22.0 

138.8 47.4 44.0 58.2 49.4 44.0 

99.9 12.0 Tie” 147 Cue ollie 
MEAN VARIANCE STD DEV ~95 QUANT 299° QUANT LARGEST VALUE 
2.0881 0.0164 Ooze 2.9407 SP ollc 3. 6992 


104 


DEROGATORY INFO DISCLOSED IN INVESTIGATION VS. CHARACTER OF SERVICE 
RECORD NUMBER = 1 

A 2 BY 2 TABLE 

WITH THE FOLLOWING EXPECTED CELL SIZES: 


pee? 96. 1 

io, 262.9 
MEAN VARIANCE STD DEV -95 QUANT .99 QUANT LARGEST VALUE 
1.1540 Omoll7 0.1082 1. 8866 2. 3570 2.6841 


DEROGATORY INFORMATION DISCLOSED IN INVESTIGATION VS. CHAR. OF SERVICE 
RECORD NUMBER = 1 

eae BY 6 TABLE 

WITH THE FOLLOWING EXPECTED CELL SIZES: 


mes 16.7 17-4 23.0 %19.5 17.4 
ree 51.3 47.6 63.0 53.5 47.6 


MEAN VARIANCE STD DEV -95 QUANT -99 QUANT LARGEST VALUE 
1.9345 G0 163 Oo1276 2.9014 3. 4961 3. /3g0 


CHAR OF SVC VS. AMOUNT OF DEROG. INFORMATION DISCLOSED IN INVESTIGATION 
RECORD NUMBER = 1 

A 6 BY 11 TABLE 

WITH THE FOLLOWING EXPECTED CELL SIZES: 


mo 623.6 €624.7 «(13.1 oe Ome 1. 6 as a. lO. 2ome 
18.7 8.1 8.4 4,5 Ore bea. 4.0 She eal 306 9..6 
17.4 1a) 7 4.1 pee) ee a7 320 Zoe Sine) gi) 
Za. O 92910. 4 DiS 38 4.3 4.9 4.0 aS, 4.4 12.0 
LS 8.4 8.8 4.7 oh 7 Sale 4.1 3.4 oo Semmes 2 
17.4 i. 7.8 4.1 ie eae Diao, 3.0) Zao ee) Jan 
MEAN VARIANCE STD DEV ~95 QUANT poo QUAND LARGEST VALUE 
2.7242 On 0226 Oral 502Z 3.7704 4.6236 4.8490 
CHAR OF SVC VS. WEIGHTED RECOMMENDATION SCORE 
RECORD NUMBER = 1 
A 6 BY 4 TABLE 
WITH THE FOLLOWING EXPECTED CELL SIZES: 
eee2 37.4 103.2 26.2 
iereo 612.8 35.2 8.9 
ier 11.9 32.7 B05 
Moo loa? 43.3 11.0 
iro 13.3 36.8 oe, 
rt 11.9 32.7 8. 3 
MEAN VARIANCE STD DEV ~95 QUANT .99 QUANT LARGEST VALUE 
eee o2s 020235 Ola Z6 S.0205 ed ia les’ 4.3912 
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CHAR OF SVC VS. RECOMMENDATION SCORE (ADJUSTED FOR NEGATIVE RECS. ) 
RECORD NUMBER = I 

A 6 BY 4 TABLE 

WITH THE FOLLOWING EXPECTED CELL SIZES: 


29.8 40-0 807.6 2756 

LO..20 Ws aoe. 7 9.4 

Se) (dao G.I 8.8 

12.5 lOee. 745. 1 Pins 

1086 14252 38.3 7.8 

9.5  lZay - 3454 cas) 

MEAN VARIANCE ST DeDEV ~95 QUANT .99 QUANT LARGEST VALUE 
Zs 3 7 O. Omw5 QO. Taleo 2. 92a 3.4407 347065 


CHAR OF SVC VS. NUMBER OF RECOMMENDATIONS DISCLOSED IN INVESTIGATION 
RECORD NUMBER = 1 

A 6 BY 5 TABLE 

WITH THE FOLLOWING EXPECTED CELL SIZES: 


22:6 S0mo 114.3 245emee. 1 

7.47 “10 2439), 0 623 4.5 

joe 927 > 3053 ssl 4,2 

Gyo tea GSe0 WZ See. 

8.0 10.9 40.7 Sa 4.7 

hee gf Boos lg 4.2 

MEAN VARIANCE STD DEV 295. QUANT .99 QUANT LARGEST VALUE 
Zoe ol On029 1 0. 1706 Seo 4.0515 4.7361 


CHAR OF SVC VS. NUMBER OF RECOMMENDATIONS DISCLOSED IN INVESTIGATION 
RECORD NUMBER = iL 

A 6 BY 4 TABLE 

WITH THE FOLLOWING EXPECIED CELEB. (Zs: 


22,5 3O0RS 114, boise 

Tod ~ \O.°45eRo9. 0 Ze 

raul 9.7 domes 12./0 

91. AZ Ome. 9 «158 

8 O07 1009 geeO. 6 1355 

ima 9) Orca 1) 

MEAN VARIANCE STD DEV » 95 QUANT .99 QUANT LARGEST VALUE 
ZOA945 OF0n7 2 Oona 3.1814 3.4176 4.0479 


CHAR OF SVC VS. NUMBER OF NOT RECOMMENDED DISCLOSED IN INVESTIGATION 
RECORD NUMBER = 1 

A 6 BY 2 TABLE 

WITH THE FOLLOWING EXPECTED CELL SIZES: 


li jee 
6027 rae) 
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MEAN VARIANCE STD DEV se QUANT .99 QUANT LARGEST VALUE 
ao O 0 O61 Orel 7 1 2.8423 3. 3049 2) /alesy4 


ADMITTED DRUG USE (SPH) VS. DRUG USE DISCLOSED BY LOCAL AGENCY CHECK 
RECORD NUMBER = il 

A 2 BY 2 TABLE 

WITH THE FOLLOWING EXPECTED CELL SIZES: 


455.7 48.3 

5S ra 
MEAN VARIANCE STD DEV ~95 QUANT .99 QUANT LARGEST VALUE 
Ilyas) OF Ue0a27 UeLoog Ze 6 ll 52027 1 3ago15 


INFO OBTAINED FROM INTERVIEW (YES OR NO) VS. REASON FOR INTERVIEW 
RECORD NUMBER = 1 

A 2 BY 4 TABLE 

WITH THE FOLLOWING EXPECTED CELL SIZES: 


ZS: 5 4.8 38.1 24.5 
meee 10.2 °° 79.9 351.5 


MEAN VARIANCE STD DEV -95 QUANT .99 QUANT LARGEST VALUE 
05 33 0. 0108 Oia Onl 2.4613 7267 fp ay 3. 0888 


AMOUNT OF INFO OBTAINED FROM SUBJECT INTERVIEW VS. REASON FOR INTERVIEW 
RECORD NUMBER = 1 

A 5 BY 4 TABLE 

meee THE FOLLOWING EAPECTED CELL SIZES: 


25. 5 4.8 38.1 24.5 

i. 5 5) as cisco mummies @ 

ee 1 meee 270) 17.4 

Saal Mind? 1105 Cag 

6.6 eo Dice 62-5 

MEAN VARIANCE STD DEV woo QUANT 7 220UEND LARGEST VALUE 
2.1487 0.0349 0. 1868 Be22/ 5 S597 U2 4.7911 
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